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GENES ENCODING ENZYMES FOR LIGNIN BIOSYNTHESIS 
AND USES THEREOF 

5 

TECHNICAL FIELD 

The present invention relates generally to plant molecular biology. More 
specifically, it relates to nucleic acids and methods for modifying the lignin content in 
plants. 

10 BACKGROUND OF THE INVENTION 

Differences in plant cell wall composition account for much of the 
variation in chemical reactivity, mechanical strength, and energy content of plant 
material. In turn, differences in chemical and mechanical properties of plant material 
greatly impact the utilization of plant biomass by agriculture and industry. One 

15 abundant component of many types of plant cells, and one which has garnered 
increasing attention because of its importance in plant utilization, are lignins. 

lignins are a class of complex heterpolymers associated with the 
polysaccharide components of the wall in specific plant cells. Lignins play an essential 
role in providing rigidity, compressive strength, and structural support to plant tissues. 

20 They also render cell walls hydrophobic allowing the conduction of water and solutes. 
Reflecting their importance, lignins represent the second most abundant organic 
compound on Earth after cellulose accounting for approximately 25% of plant biomass. 
Lignins result from the oxidative coupling of three monomers: coumaryl, coniferyl, and 
sinapyl alcohols. Variability in lignin structure is dependent, in part, upon the relative 

25 proportion of the three constitutive monomers. 

The biosynthesis of lignins proceeds from phenylalanine through the 
phenylpropanoid pathway to the cinnamoyl CoAs which are the general precursors of a 
wide range of phenolic compounds. The enzymes involved in this pathway are 
phenylalanine ammonia-lyase (PAL), cinnamate-4-hydroxylase (C4H), 4-coumarate-3- 

30 hydroxylase (C3H), O-methyltransferase (OMT), ferulate-5-hydroxylase (F5H), 
caffeoyl-CoA 3-O-methyltransf erase (CCoA-OMT), and 4-coumarate:CoA ligase 
(4CL). Whetten and Sederoff, The Plant Cell, 7: 1001-1013 (1995); Boudet and Grima- 
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Pettenati, Molecular Breeding, 2:25-39 (1996). 

The lignin specific pathway channels cinnamoyl CoAs towards the 
synthesis of monolignols and lignins. This pathway involves two reductive enzymes 
that convert the hydroxycinnamoyl-CoA esters into monolignols: cinnamoyl-CoA 
5 reductase (CCR), and cinnamyl alcohol dehydrogenase (CAD). 

While lignins are a vital component in terrestrial vascular plants, they 
often pose an obstacle to the utilization of plant biomass. For example, in the pulp and 
paper industry lignins have to be separated from cellulose by an expensive and polluting 
process. Lignin content also limits the digestability of crops consumed by livestock. 

10 While reduction of lignin content for such applications is generally desirable, increasing 
lignin content in plant material intended as a chemical feedstock for production of 
phenolics, for use as a fuel source, or for improvement in agronomically desirable 
properties (e.g., standability) is also advantageous. Accordingly, what is needed in the 
art is the ability to modulate lignin content in plants. The present invention addresses 

IS these and other needs. 

SUMMARY OF THE INVENTION 

Generally, it is the object of the present invention to provide nucleic 
acids and proteins relating to lignin biosynthesis. It is an object of the present invention 
to provide antigenic fragments of the proteins of the present invention. It is an object 

20 of the present invention to provide transgenic plants comprising the nucleic acids of the 
present invention. Additionally, it is an object of the present invention to provide 
methods for modulating, in a transgenic plant, the expression of the nucleic acids of the 
present invention. 

Therefore, in one aspect, the present invention relates to an isolated 
25 nucleic acid comprising a member selected from the group consisting of (a) a 
polynucleotide having at least 60% identity to a polynucleotide encoding a polypeptide 
selected from the group consisting of SEQ ID NOS: -18 and 73-75, wherein the 
polypeptide when presented as an immunogen elicits the production of an antibody 
which is specifically reactive to the polypeptide; (b) a polynucleotide which is 
30 complementary to the polynucleotide of (a); and (c) a polynucleotide comprising at least 
25 contiguous nucleotides from a polynucleotide of (a) or (b). In some embodiments, 
the polynucleotide has a sequence selected from the group consisting of SEQ ID NOS: 
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19-36 and 76-78. The isolated nucleic acid can be DNA. 

In another aspect, the present invention relates to recombinant expression 
cassettes, comprising a nucleic acid as described, supra, operably linked to a promoter. 
In some embodiments, the nucleic acid is operably linked in antisense orientation to the 
5 promoter. 

In another aspect, the present invention is directed to a host cell 
transfected with the recombinant expression cassette as described, supra. In some 
embodiments, the host cell is a sorghum (Sorghum bicolor) or maize (Zea mays) cell. 

In a further aspect, the present invention relates to an isolated protein 
10 comprising a polypeptide of at least 10 contiguous amino acids encoded by the isolated 
nucleic acid referred to, supra. In some embodiments, the polypeptide has a sequence 
selected from the group consisting of SEQ ID NOS:l-18 and 73-75. 

In another aspect, the present invention relates to an isolated nucleic acid 
comprising a polynucleotide of at least 25 nucleotides in length which selectively 
15 hybridizes under stringent conditions to a nucleic acid selected from the group 
consisting of SEQ ID NOS: 19-36 and 76-78, or a complement thereof. In some 
embodiments, the isolated nucleic acid is operably linked to a promoter. 

In yet another aspect, the present invention relates to an isolated nucleic 
acid comprising a polynucleotide, the polynucleotide having at least 80% sequence 
20 identity to an identical length of a nucleic acid selected from the group consisting of 
SEQ ID NOS: 19-36 and 76-78 or a complement thereof. 

In another aspect, the present invention relates to an isolated nucleic acid 
comprising a polynucleotide having a sequence of a nucleic acid amplified from a Zea 
mays nucleic acid library using the primers selected from the group consisting of SEQ 
25 ID NOS: 37-72 and 79-84, or complements thereof. In some embodiments, the nucleic 
acid library is a cDNA library. 

In another aspect, the present invention relates to a recombinant 
expression cassette comprising a nucleic acid amplified from a library as referred to 
supra, wherein the nucleic acid is operably linked to a promoter. In some 
30 embodiments, the present invention relates to a host cell transfected with this 
recombinant expression cassette In some embodiments, the present invention relates to 
a protein of the present invention which is produced from this host cell. 
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In an additional aspect, the present invention is directed to an isolated 
nucleic acid comprising a polynucleotide encoding a polypeptide wherein: (a) the 
polypeptide comprises at least 10 contiguous amino acid residues from a first 
polypeptide selected from the group consisting of SEQ ID NOS:l-18 and 73-75, 
5 wherein said polypeptide, when presented as an immunogen, elicits the production of an 
antibody which specifically binds to said first polypeptide; (b) the polypeptide does not 
bind to antisera raised against the first polypeptide which has been fully immunosorbed 
with the first polypeptide; (c) the polypeptide has a molecular weight in non- 
glycosylated form within 10% of the first polypeptide. 
10 In a further aspect, the present invention relates to a heterologous 

promoter operably linked to a non-isolated polynucleotide of the present invention, 
wherein the polypeptide is encoded by a nucleic acid amplified from a nucleic acid 
library. 

In yet another aspect, the present invention relates to a transgenic plant 
15 comprising a recombinant expression cassette comprising a plant promoter operably 
linked to any of the isolated nucleic acids of the present invention. In some 
embodiments, the transgenic plant is Zea mays. The present invention also provides 
transgenic seed from the transgenic plant. 

In a further aspect, the present invention relates to a method of 
20 modulating expression of the genes encoding the proteins of the present invention in a 
plant, comprising the steps of (a) transforming a plant cell with a recombinant 
expression cassette comprising a polynucleotide of the present invention operably linked 
to a promoter; (b) growing the plant cell under plant growing conditions; and (c) 
inducing expression of the polynucleotide for a time sufficient to modulate expression 
25 of the genes in the plant. In some embodiments, the plant is maize. Expression of the 
genes encoding the proteins of the present invention can be increased or decreased 
relative to a non-transformed control plant. 
Definitions 

Units, prefixes, and symbols may be denoted in their SI accepted form. 
30 Unless otherwise indicated, nucleic acids are written left to right in 5' to 3 1 orientation; 
amino acid sequences are written left to right in amino to carboxy orientation, 
respectively. Numeric ranges are inclusive of the numbers defining the range. Amino 
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acids may be referred to herein by either their commonly known three letter symbols or 
by the one-letter symbols recommended by the IUPAC-IUB Biochemical Nomenclature 
Commission. Nucleotides, likewise, may be referred to by their commonly accepted 
single-letter codes. The terms defined below are more fully defined by reference to the 

5 specification as a whole. 

By "amplified" is meant the construction of multiple copies of a nucleic 
acid sequence or multiple copies complementary to the nucleic acid sequence using at 
least one of the nucleic acid sequences as a template. Amplification systems include the 
polymerase chain reaction (PCR) system, ligase chain reaction (LCR) system, nucleic 

10 acid sequence based amplification (NASBA, Cangene, Mississauga, Ontario), Q-Beta 
Replicase systems, transcription-based amplification system (TAS), and strand 
displacement amplification (SDA). See, e.g., Diagnostic Molecular Microbiology: 
Principles and Applications, D. H. Persing et al., Ed., American Society for 
Microbiology, Washington, D.C. (1993). The product of amplification is termed an 

IS amplicon. 

The term "antibody" includes reference to antigen binding forms of 
antibodies (e.g., Fab, F(ab)2). The term "antibody" frequently refers to a polypeptide 
substantially encoded by an immunoglobulin gene or immunoglobulin genes, or 
fragments thereof which specifically bind and recognize an analyte (antigen). 

20 However, while various antibody fragments can be defined in terms of the digestion of 
an intact antibody, one of skill will appreciate that such fragments may be synthesized 
de novo either chemically or by utilizing recombinant DNA methodology. Thus, the 
term antibody, as used herein, also includes antibody fragments such as single chain 
Fv, chimeric antibodies (i.e., comprising constant and variable regions from different 

25 species), humanized antibodies (i.e., comprising a complementarity determining region 
(CDR) from a non-human source) and heteroconjugate antibodies (e.g., bispecific 
antibodies). 

The term "antigen" includes reference to a substance to which an 
antibody can be generated and/or to which the antibody is specifically immunoreactive. 
30 The specific immunoreactive sites within the antigen are known as epitopes or antigenic 
determinants. These epitopes can be a linear array of monomers in a polymeric 
composition - such as amino acids in a protein - or consist of or comprise a more 
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complex secondary or tertiary structure. Those of skill will recognize that all 
immunogens (i.e., substance capable of eliciting an immune response) are antigens; 
however some antigens, such as haptens, are not immunogens but may be made 
immunogenic by coupling to a carrier molecule. An antibody immunologically reactive 

5 with a particular antigen can be generated in vivo or by recombinant methods such as 
selection of libraries of recombinant antibodies in phage or similar vectors. See, e.g., 
Huse et aL, Science 246: 1275-1281 (1989); and Ward, et aL, Nature 341: 544-546 
(1989); and Vaughan et al., Nature Biotech. 14: 309-314 (1996). 

As used herein, " an ti sense orientation" includes reference to a duplex 

10 polynucleotide sequence which is operably linked to a promoter in an orientation where 
the antisense strand is transcribed. The antisense strand is sufficiently complementary 
to an endogenous transcription product such that translation of the endogenous 
transcription product is often inhibited. 

As used herein, "chromosomal region" includes reference to a length of 

15 chromosome which may be measured by reference to the linear segment of DNA which 
it comprises. The chromosomal region can be defined by reference to two unique DNA 
sequences, i.e., markers. 

The term "conservatively modified variants" applies to both amino acid 
and nucleic acid sequences. With respect to particular nucleic acid sequences, 

20 conservatively modified variants refers to those nucleic acids which encode identical or 
conservatively modified variants of the amino acid sequences. Because of the 
degeneracy of the genetic code, a large number of functionally identical nucleic acids 
encode any given protein. For instance, the codons GCA, GCC, GCG and GCU all 
encode the amino acid alanine. Thus, at every position where an alanine is specified by 

25 a codon, the codon can be altered to any of the corresponding codons described without 
altering the encoded polypeptide. Such nucleic acid variations are "silent variations" 
and represent one species of conservatively modified variation. Every nucleic acid 
sequence herein which encodes a polypeptide also describes every possible silent 
variation of the nucleic acid. One of ordinary skill will recognize that each codon in a 

30 nucleic acid (except AUG, which is ordinarily the only codon for methionine) can be 
modified to yield a functionally identical molecule. Accordingly, each silent variation 
of a nucleic acid which encodes a polypeptide of the present invention is implicit in 
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each described polypeptide sequence and incorporated herein by reference. 

As to amino acid sequences, one of skill will recognize that individual 
substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 
sequence which alters, adds or deletes a single amino acid or a small percentage of 
5 amino acids in the encoded sequence is a "conservatively modified variant" where the 
alteration results in the substitution of an amino acid with a chemically similar amino 
acid. Thus, any number of amino acid residues selected from the group of integers 
consisting of from 1 to IS can be so altered. Thus, for example, 1, 2, 3, 4, 5, 7, or 10 
alterations can be made. Conservatively modified variants typically provide similar 

10 biological activity as the unmodified polypeptide sequence from which they are derived. 
For example, substrate specificity, enzyme activity, or ligand/receptor binding is 
generally at least 30%, 40%, 50%, 60%, 70%, 80%, or 90% of the native protein for 
it's native substrate. Conservative substitution tables providing functionally similar 
amino acids are well known in the art. 

15 The following six groups each contain amino acids that are conservative 

substitutions for one another: 

1) Alanine (A), Serine (S), Threonine (T); 

2) Aspartic acid (D), Glutamic acid (E); 

3) Asparagine (N), Glutamine (Q); 
20 4) Arginine (R), Lysine (K); 

5) Isoleucine (I), Leucine (L), Methionine (M), Valine (V); and 

6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W). 

See also, Creighton (1984) Proteins W.H. Freeman and Company. 

By "encoding" or "encoded", with respect to a specified nucleic acid, is 

25 meant comprising the information for translation into the specified protein. A nucleic 
acid encoding a protein may comprise non-translated sequences (e.g., introns) within 
translated regions of the nucleic acid, or may lack such intervening non-translated 
sequences (e.g., as in cDNA). The information by which a protein is encoded is 
specified by the use of codons. Typically, the amino acid sequence is encoded by the 

30 nucleic acid using the "universal" genetic code. However, variants of the universal 
code, such as is present in some plant, animal, and fungal mitochondria, the bacterium 
Mycoplasma capricolum (Proc. Natl. Acad. ScL (USA), 82: 2306-2309 (1985)), or the 
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ciliate Macronucleus, may be used when the nucleic acid is expressed using these 
organisms. 

When the nucleic acid is prepared or altered synthetically, advantage can 
be taken of known codon preferences of the intended host where the nucleic acid is to 

5 be expressed. For example, although nucleic acid sequences of the present invention 
may be expressed in both monocotyledonous and dicotyledonous plant species, 
sequences can be modified to account for the specific codon preferences and GC content 
preferences of monocotyledons or dicotyledons as these preferences have been shown to 
differ (Murray et al. Nucl. Acids Res. 17: 477-498 (1989)). Thus, the maize preferred 

10 codon for a particular amino acid may be derived from known gene sequences from 
maize. Maize codon usage for 28 genes from maize plants are listed in Table 4 of 
Murray et al., supra. 

As used herein "full-length sequence" in reference to a specified 
polynucleotide or its encoded protein means having the entire amino acid sequence of, a 

15 native (non-synthetic), endogenous, catalytically active form of the specified protein. A 
full-length sequence can be determined by size comparison relative to a control which is 
a native (non-synthetic) endogenous cellular form of the specified nucleic acid or 
protein. Methods to determine whether a sequence is full-length are well known in the 
art including such exemplary techniques as northern or western blots, primer extension, 

20 SI protection, and ribonuclease protection. See, e.g., Plant Molecular Biology: A 
Laboratory Manual, Clark, Ed., Springer-Verlag, Berlin (1997). Comparison to 
known full-length homologous (orthologous and/or paralogous) sequences can also be 
used to identify full-length sequences of the present invention. Additionally, consensus 
sequences typically present at the 5* and 3' untranslated regions of mRNA aid in the 

25 identification of a polynucleotide as full-length. For example, the consensus sequence 
ANNNNAIIQG, where the underlined codon represents the N-terminal methionine, 
aids in determining whether the polynucleotide has a complete 5' end. Consensus 
sequences at the 3' end, such as polyadenylation sequences, aid in determining whether 
the polynucleotide has a complete 3' end. 

30 As used herein, "heterologous" in reference to a nucleic acid is a nucleic 

acid that originates from a foreign species, or, if from the same species, is substantially 
modified from its native form in composition and/or genomic locus by deliberate 



WO 99/10498 



PCT/US98/17519 



human intervention. For example, a promoter operably linked to a heterologous 
structural gene is from a species different from that from which the structural gene was 
derived, or, if from the same species, one or both are substantially modified from their 
original form. A heterologous protein may originate from a foreign species or, if from 
5 the same species, is substantially modified from its original form by deliberate human 
intervention. 

By "host cell" is meant a cell which contains a vector and supports the 
replication and/or expression of the expression vector. Host cells may be prokaryotic 
cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 

10 mammalian cells. Preferably, host cells are monocotyledonous or dicotyledonous plant 
cells. A particularly preferred monocotyledonous host cell is a maize host cell. 

The term "hybridization complex" includes reference to a duplex nucleic 
acid structure formed by two single-stranded nucleic acid sequences selectively 
hybridized with each other. 

15 By "immunologically reactive conditions" or "immunoreactive 

conditions" is meant conditions which allow an antibody, generated to a particular 
epitope, to bind to that epitope to a detectably greater degree (e.g., at least 2-fold over 
background) than the antibody binds to substantially all other epitopes in a reaction 
mixture comprising the particular epitope. Immunologically reactive conditions are 

20 dependent upon the format of the antibody binding reaction and typically are those 
utilized in immunoassay protocols. See Harlow and Lane, Antibodies, A Laboratory 
Manual, Cold Spring Harbor Publications, New York (1988), for a description of 
immunoassay formats and conditions. 

The term "introduced" in the context of inserting a nucleic acid into a 

25 cell, means "transfection" or "transformation" or "transduction" and includes reference 
to the incorporation of a nucleic acid into a eukaryotic or prokaryotic cell where the 
nucleic acid may be incorporated into the genome of the cell (e.g., chromosome, 
plasmid, plastid or mitochondrial DNA), converted into an autonomous replicon, or 
transiently expressed (e.g., transfected mRNA). 

30 The terms "isolated" refers to material, such as a nucleic acid or a 

protein, which is: (1) substantially or essentially free from components which normally 
accompany or interact with it as found in its naturally occurring environment. The 



WO 99/10498 



PCT/US98/17519 

- 10 - 



isolated material optionally comprises material not found with the material in its natural 
environment; or (2) if the material is in its natural environment, the material has been 
synthetically (non-naturally) altered by deliberate human intervention to a composition 
and/or placed at a locus in the cell (e.g., genome or subcellular organelle) not native to 

5 a material found in that environment. The alteration to yield the synthetic material can 
be performed on the material within or removed from its natural state. For example, a 
naturally occurring nucleic acid becomes an isolated nucleic acid if it is altered, or if it 
is transcribed from DNA which has been altered, by non-natural, synthetic (i.e., "man- 
made") methods performed within the cell from which it originates. See, e.g., 

10 Compounds and Methods for Site Directed Mutagenesis in Eukaryotic Cells, Kmiec, 
U.S. Patent No. 5,565,350; In Vivo Homologous Sequence Targeting in Eukaryotic 
Cells; Zarling et a/., PCT/US93/03868. Likewise, a naturally occurring nucleic acid 
(e.g., a promoter) becomes isolated if it is introduced by non-naturally occurring means 
to a locus of the genome not native to that nucleic acid. Nucleic acids which are 

15 "isolated" as defined herein, are also referred to as "heterologous" nucleic acids. 

Unless otherwise stated, the term "lignin biosynthesis nucleic acid" 
means a nucleic acid comprising a polynucleotide ("lignin biosynthesis polynucleotide") 
encoding a lignin biosynthesis polypeptide. A "lignin biosynthesis gene" refers to a 
non-heterologous genomic form of a full-length lignin biosynthesis polynucleotide. 

20 As used herein, "localized within the chromosomal region defined by and 

including" with respect to particular markers includes reference to a contiguous length 
of a chromosome delimited by and including the stated markers. 

As used herein, "marker" includes reference to a locus on a chromosome 
that serves to identify a unique position on the chromosome. A "polymorphic marker" 

25 includes reference to a marker which appears in multiple forms (alleles) such that 
different forms of the marker, when they are present in a homologous pair, allow 
transmission of each of the chromosomes in that pair to be followed. A genotype may 
be defined by use of one or a plurality of markers. 

As used herein, "nucleic acid" includes reference to a 

30 deoxyribonucleotide or ribonucleotide polymer in either single- or double-stranded 
form, and unless otherwise limited, encompasses known analogues having the essential 
nature of natural nucleotides in that they hybridize to single-stranded nucleic acids in a 
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manner similar to naturally occurring nucleotides (e.g., peptide nucleic acids). 

By "nucleic acid library " is meant a collection of isolated DNA or RNA 
molecules which comprise and substantially represent the entire transcribed fraction of a 
genome of a specified organism. Construction of exemplary nucleic acid libraries, such 

5 as genomic and cDNA libraries, is taught in standard molecular biology references such 
as Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in 
Enzymology, Vol. 152, Academic Press, Inc., San Diego, CA (Berger); Sambrook et 
ah, Molecular Cloning - A Laboratory Manual, 2nd ed., Vol. 1-3 (1989); and Current 
Protocols in Molecular Biology, F.M. Ausubel et al f Eds., Current Protocols, a joint 

10 venture between Greene Publishing Associates, Inc. and John Wiley & Sons, Inc. (1994 
Supplement). 

As used herein "operably linked" includes reference to a functional 
linkage between a promoter and a second sequence, wherein the promoter sequence 
initiates and mediates transcription of the DNA sequence corresponding to the second 

15 sequence. Generally, operably linked means that the nucleic acid sequences being 
linked are contiguous and, where necessary to join two protein coding regions, 
contiguous and in the same reading frame. 

As used herein, the term "plant" includes reference to whole plants, plant 
organs (e.g., leaves, stems, roots, etc.), seeds and plant cells and progeny of same. 

20 Plant cell, as used herein includes, without limitation, seeds suspension cultures, 
embryos, meristematic regions, callus tissue, leaves, roots, shoots, 
gametophytes, sporophytes, pollen, and microspores. The class of plants which can be 
used in the methods of the invention is generally as broad as the class of higher plants 
amenable to transformation techniques, including both monocotyledonous and 

25 dicotyledonous plants. A particularly preferred plant is Zea mays. 

As used herein, "polynucleotide" includes reference to a 
deoxyribopolynucleotide, ribopolynucleotide, or analogs thereof that have the essential 
nature of a natural ribonucleotide in that they hybridize, under stringent hybridization 
conditions, to substantially the same nucleotide sequence as naturally occurring 

30 nucleotides and/or allow translation into the same amino acid(s) as the naturally 
occurring nucleotide(s). A polynucleotide can be full-length or a subsequence of a 
native or heterologous structural or regulatory gene. Unless otherwise indicated, the 
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term includes reference to the specified sequence as well as the complementary 
sequence thereof. Thus, DNAs or RNAs with backbones modified for stability or for other 
reasons are "polynucleotides" as that term is intended herein. Moreover, DNAs or RNAs 
comprising unusual bases, such as inosine, or modified bases, such as tritylated bases, to 

5 name just two examples, are polynucleotides as the term is used herein. It will be 
appreciated that a great variety of modifications have been made to DNA and RNA that 
serve many useful purposes known to those of skill in the art Hie term polynucleotide as it 
is employed herein embraces such chemically, enzymatically or metabolically modified 
forms of polynucleotides, as well as the chemical forms of DNA and RNA characteristic of 

10 viruses and cells, including inter alia, simple and complex cells. 



interchangeably herein to refer to a polymer of amino acid residues. The terms apply 
to amino acid polymers in which one or more amino acid residue is an artificial 
chemical analogue of a corresponding naturally occurring amino acid, as well as to 

15 naturally occurring amino acid polymers. The essential nature of such analogues of 
naturally occurring amino acids is that, when incorporated into a protein, that protein is 
specifically reactive to antibodies elicited to the same protein but consisting entirely of 
naturally occurring amino acids. The terms "polypeptide", "peptide" and "protein" are 
also inclusive of modifications including, but not limited to, glycosylation, lipid attachment, 

20 sulfation, gamma-carboxylation of glutamic acid residues, hydroxylation and ADP- 
ribosylation. Exemplary modifications are described in most basic texts, such as, Proteins - 
Structure and Molecular Properties, 2nd ed., T. E. Credghton, W. H. Freeman and 
Company, New York (1993). Many detailed reviews are available on this subject, such as, 
for example, those provided by Wold, F., Posttranslational Protein Modifications: 

25 Perspectives and Prospects, pp. 1-12 in Posttranslational Covalent Modification of Proteins, 
B. C. Johnson, Ed., Academic Press, New York (1983); Softer et a/., Meth. EnzymoL 
182: 626-646 (1990) and Rattan et al. 9 Protein Synthesis: Posttranslational Modifications 
and Aging, Ann. N.Y. Acad. Sci. 663: 48-62 (1992). It will be appreciated, as is well 
known and as noted above, that polypeptides are not always entirely linear. For instance, 

30 polypeptides may be branched as a result of ubiquitination, and they may be circular, with 
or without brandling, generally as a result of posttranslation events, including natural 
processing event and events brought about by human manipulation which do not occur 



The terms "polypeptide", "peptide" and "protein" are used 
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naturally. Circular, branched and branched circular polypeptides may be synthesized by 
non-translation natural process and by entirely synthetic methods, as well. Modifications 
can occur anywhere in a polypeptide, including the peptide backbone, the amino acid side- 
chains and the amino or caiboxyl termini. In fact, blockage of the amino or carboxyl group 
5 in a polypeptide, or both, by a covalent modification, is common in naturally occurring and 
synthetic polypeptides and such modifications may be present in polypeptides of the present 
invention, as well. For instance, the amino terminal residue of polypeptides made in E. coli 
or other cells, prior to proteolytic processing, almost invariably will be N- 
formylmethionine. During post-translational modification of the peptide, a methionine 

10 residue at the NH 2 -terminus may be deleted. Accordingly, this invention contemplates 
the use of both the methionine-containing and the methionineless amino terminal 
variants of the protein of the invention. In general, as used herein, the term polypeptide 
encompasses all such modifications, particularly those that are present in polypeptides 
synthesized by expressing a polynucleotide in a host cell. 

15 As used herein "promoter" includes reference to a region of DNA 

upstream from the start of transcription and involved in recognition and binding of 
RNA polymerase and other proteins to initiate transcription. A "plant promoter" is a 
promoter capable of initiating transcription in plant cells. Exemplary plant promoters 
include, but are not limited to, those that are obtained from plants, plant viruses, and 

20 bacteria which comprise genes expressed in plant cells such Agrobacterium or 
Rhizobium. Examples of promoters under developmental control include promoters that 
preferentially initiate transcription in certain tissues, such as leaves, roots, seeds, fibers, 
xylem vessels, tracheids, or sclerenchyma. Such promoters are referred to as "tissue 
preferred". Promoters which initiate transcription only in certain tissue are referred to 

25 as "tissue specific". A "cell type" specific promoter primarily drives expression in 
certain cell types in one or more organs, for example, vascular cells in roots or leaves. 
An "inducible" promoter is a promoter which is under environmental control. 
Examples of environmental conditions that may effect transcription by inducible 
promoters include anaerobic conditions or the presence of light. Tissue specific, tissue 

30 preferred, cell type specific, and inducible promoters constitute the class of "non- 
constitutive" promoters. A "constitutive" promoter is a promoter which is active under 
most environmental conditions. 
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The term H lignin biosynthesis polypeptide" refer to one or more amino 
acid sequences, in glycosylated or non-glycosylated form, involved in the lignin 
biosynthesis pathway. The term is also inclusive of fragments, variants, homologs, 
alleles or precursors (e.g., preproproteins or pioproteins) thereof. A "lignin 
5 biosynthesis protein" comprises a lignin biosynthesis polypeptide. 

As used herein "recombinant" includes reference to a cell or vector, that 
has been modified by the introduction of a heterologous nucleic acid or that the cell is 
derived from a cell so modified. Thus, for example, recombinant cells express genes 
that are not found in identical form within the native (non-recombinant) form of the cell 
10 or express native genes that are otherwise abnormally expressed, under expressed or not 
expressed at all as a result of deliberate human intervention. The term "recombinant" 
as used herein does not encompass the alteration of the cell or vector by naturally 
occurring events (e.g., spontaneous mutation, natural 

transformation/transduction/transposition) such as those occurring without deliberate 
15 human intervention. 

As used herein, a "recombinant expression cassette" is a nucleic acid 
construct, generated recombinantly or synthetically, with a series of specified nucleic 
acid elements which permit transcription of a particular nucleic acid in a target cell. 
The recombinant expression cassette can be incorporated into a plasmid, chromosome, 

20 mitochondrial DNA, plastid DNA, virus, or nucleic acid fragment. Typically, the 
recombinant expression cassette portion of an expression vector includes, among other 
sequences, a nucleic acid to be transcribed, and a promoter. 

The term "residue" or "amino acid residue" or "amino acid" are used 
interchangeably herein to refer to an amino acid that is incorporated into a protein, 

25 polypeptide, or peptide (collectively "protein"). The amino acid may be a naturally 
occurring amino acid and, unless otherwise limited, may encompass known analogs of 
natural amino acids that can function in a similar manner as naturally occurring amino 
acids. 

The term "selectively hybridizes" includes reference to hybridization, 
30 under stringent hybridization conditions, of a nucleic acid sequence to a specified 
nucleic acid target sequence to a detectably greater degree (e.g., at least 2-fold over 
background) than its hybridization to non-target nucleic acid sequences and to the 
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substantial exclusion of non-target nucleic acids. Selectively hybridizing sequences 
typically have about at least 80% sequence identity, preferably 90% sequence identity, 
and most preferably 100% sequence identity <i.e., complementary) with each other. 

The term "specifically reactive", includes reference to a binding reaction 
5 between an antibody and a protein having an epitope recognized by the antigen binding 
site of the antibody. This binding reaction is determinative of the presence of a protein 
having the recognized epitope amongst the presence of a heterogeneous population of 
proteins and other biologies. Thus, under designated immunoassay conditions, the 
specified antibodies bind to an analyte having the recognized epitope to a substantially 
greater degree (e.g., at least 2-fold over background) than to substantially all other 
analytes lacking the epitope which are present in the sample. 

Specific binding to an antibody under such conditions may require an 
antibody that is selected for its specificity for a particular protein. For example, 
antibodies raised to the polypeptides of the present invention can be selected from to 
obtain antibodies specifically reactive with polypeptides of the present invention. The 
proteins used as immunogens can be in native conformation or denatured so as to 
provide a linear epitope. 

A variety of immunoassay formats may be used to select antibodies 
specifically reactive with a particular protein (or other analyte). For example, solid- 
phase ELISA immunoassays are routinely used to select monoclonal antibodies 
specifically immunoreactive with a protein. See Harlow and Lane, Antibodies, A 
laboratory Manual, Cold Spring Harbor Publications, New York (1988), for a 
description of immunoassay formats and conditions that can be used to determine 
selective reactivity. 

The terms "stringent conditions" or "stringent hybridization conditions- 
includes reference to conditions under which a probe will hybridize to its target 
sequence, to a detectably greater degree than other sequences (e.g., at least 2-fold over 
background). Stringent conditions are sequence-dependent and will be different in 
different circumstances. By controlling the stringency of the hybridization and/or 
washing conditions, target sequences can be identified which are 100% complementary 
to the probe (homologous probing). Alternatively, stringency conditions can be 
adjusted to allow some mismatching in sequences so that lower degrees of similarity are 
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detected (heterologous probing). Generally, a probe is less than about 1000 nucleotides 
in length, preferably less than 500 nucleotides in length. 

Typically, stringent conditions will be those in which the salt 
concentration is less than about 1.5 M Na ion, typically about 0.01 to 1.0 M Na ion 

5 concentration (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 
30°C for short probes {e.g., 10 to 50 nucleotides) and at least about 60°C for long 
probes (e.g., greater than 50 nucleotides). Stringent conditions may also be achieved 
with the addition of destabilizing agents such as formamide. Exemplary low stringency 
conditions include hybridization with a buffer solution of 30 to 35% formamide, 1 M 

0 NaCl, 1% SDS (sodium dodecyl sulphate) at 37°C, and a wash in IX to 2X SSC (20X 
SSC = 3.0 M NaCl/0.3 M trisodium citrate) at 50 to 55°C. Exemplary moderate 
stringency conditions include hybridization in 40 to 45% formamide, 1 M NaCl, 1% 
SDS at 37°C, and a wash in 0.5X to IX SSC at 55 to 60°C. Exemplary high 
stringency conditions include hybridization in 50% formamide, 1 M NaCl, 1% SDS at 

5 37°C, and a wash in 0. IX SSC at 60 to 65 °C. 

Specificity is typically the function of post-hybridization washes, the 
critical factors being the ionic strength and temperature of the final wash solution. For 
DNA-DNA hybrids, the T m can be approximated from the equation of Meinkoth and 
Wahl, Anal. Biochem., 138:267-284 (1984): T m - 81.5 °C + 16.6 (log M) + 0.41 

0 (%GC) - 0.61 (% form) - 500/L; where M is the molarity of monovalent cations, %GC 
is the percentage of guanosine and cytosine nucleotides in the DNA, % form is the 
percentage of formamide in the hybridization solution, and L is the length of the hybrid 
in base pairs. The T m is the temperature (under defined ionic strength and pH) at which 
50% of a complementary target sequence hybridizes to a perfectly matched probe. T m 

5 is reduced by about 1 °C for each 1% of mismatching; thus, T m , hybridization and/or 
wash conditions can be adjusted to hybridize to sequences of the desired identity. For 
example, if sequences with ^90% identity are sought, the T m can be decreased 10 °C. 
Generally, stringent conditions are selected to be about 5 °C lower than the thermal 
melting point (TJ for the specific sequence and its complement at a defined ionic 

[) strength and pH, However, severely stringent conditions can utilize a hybridization 
and/or wash at 1, 2, 3, or 4 °C lower than the thermal melting point (TJ; moderately 
stringent conditions can utilize a hybridization and/or wash at 6, 7, 8, 9, or 10 °C 
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lower than the thermal melting point (TJ; low stringency conditions can utilize a 
hybridization and/or wash at 11, 12, 13, 14, 15, or 20 "C lower than the thermal 
melting point (TJ. Using the equation, hybridization and wash compositions, and 
desired T m , those of ordinary skill will understand that variations in the stringency of 
5 hybridization and/or wash solutions are inherently described. If the desired degree of 
mismatching results in a T m of less than 45 °C (aqueous solution) or 32 °C (formamide 
solution) it is preferred to increase the SSC concentration so that a higher temperature 
can be used. An extensive guide to the hybridization of nucleic acids is found in 
Tijssen, Laboratory Techniques in Biochemistry and Molecular Biology-Hybridization 

10 with Nucleic Acid Probes. Part I, Chapter 2 "Overview of principles of hybridization 
and the strategy of nucleic acid probe assays", Elsevier, New York (1993); and Current 
Protocols in Molecular Biology, Chapter 2, Ausubel, et al., Eds., Greene Publishing 
and Wiley-Interscience, New York (1995). 

As used herein, "transgenic plant" includes reference to a plant which 

15 comprises within its genome a heterologous polynucleotide. Generally, the 
heterologous polynucleotide is stably integrated within the genome such that the 
polynucleotide is passed on to successive generations. The heterologous polynucleotide 
may be integrated into the genome alone or as part of a recombinant expression 
cassette. "Transgenic" is used herein to include any cell, cell line, callus, tissue, plant 

20 part or plant, the genotype of which has been altered by the presence of heterologous 
nucleic acid including those transgenics initially so altered as well as those created by 
sexual crosses or asexual propagation from the initial transgenic. The term "transgenic" 
as used herein does not encompass the 

alteration of the genome (chromosomal or extra-chromosomal) by conventional plant 
25 breeding methods or by naturally occurring events such as random cross-fertilization, 

non-recombinant viral infection, non-recombinant bacterial transformation, non- 

recombinant transposition, or spontaneous mutation. 

As used herein, "vector" includes reference to a nucleic acid used in 

transfection of a host cell and into which can be inserted a polynucleotide. Vectors are 
30 often replicons. Expression vectors permit transcription of a nucleic acid inserted 

therein. 

The following terms are used to describe the sequence relationships 
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between two or more nucleic acids or polynucleotides: (a) "reference sequence", (b) 
"comparison window", (c) "sequence identity", (d) "percentage of sequence identity", 
and (e) "substantial identity", 

(a) As used herein, "reference sequence" is a defined sequence 
5 used as a basis for sequence comparison. A reference sequence may be a subset or the 

entirety of a specified sequence; for example, as a segment of a full-length cDNA or 
gene sequence, or the complete cDNA or gene sequence. 

(b) As used herein, "comparison window" means includes 
reference to a contiguous and specified segment of a polynucleotide sequence, wherein 

10 the polynucleotide sequence may be compared to a reference sequence and wherein the 
portion of the polynucleotide sequence in the comparison window may comprise 
additions or deletions (i.e., gaps) compared to the reference sequence (which does not 
comprise additions or deletions) for optimal alignment of the two sequences. 
Generally, the comparison window is at least 20 contiguous nucleotides in length, and 

15 optionally can be 30, 40, 50, 100, or longer. Those of skill in the art understand that 
to avoid a high similarity to a reference sequence due to inclusion of gaps in the 
polynucleotide sequence a gap penalty is typically introduced and is subtracted from the 
number of matches. 

Methods of alignment of sequences for comparison are well-known in the 

20 art. Optimal alignment of sequences for comparison may be conducted by the local 
homology algorithm of Smith and Waterman, Adv. Appl. Math. 2: 482 (1981); by the 
homology alignment algorithm of Needleman and Wunsch, J. Mol Biol. 48: 443 
(1970); by the search for similarity method of Pearson and Iipman, Proc. Natl. Acad. 
Sci. 85: 2444 (1988); by computerized implementations of these algorithms, including, 

25 but not limited to: CLUSTAL in the PC/Gene program by Intelligenetics, Mountain 
View, California, GAP, BESTFTT, BLAST, FASTA, and TFASTA in the Wisconsin 
Genetics Software Package, Genetics Computer Group (GCG), 575 Science Dr., 
Madison, Wisconsin, USA; the CLUSTAL program is well described by Higgins and 
Sharp, Gene 73: 237-244 (1988); Higgins and Sharp, CABIOS 5: 151-153 (1989); 

30 Corpet, et al., Nucleic Acids Research 16: 10881-90 (1988); Huang, et al., Computer 
Applications in the Biosciences 8: 155-65 (1992), and Pearson, et al., Methods in 
Molecular Biology 24: 307-331 (1994). The BLAST family of programs which can be 
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used for database similarity searches includes: BLASTN for nucleotide query sequences 
against nucleotide database sequences; BLASTX for nucleotide query sequences against 
protein database sequences; BLASTP for protein query sequences against protein 
database sequences; TBLASTN for protein query sequences against nucleotide database 
5 sequences; and TBLASTX for nucleotide query sequences against nucleotide database 
sequences. See, Current Protocols in Molecular Biology, Chapter 19, Ausubel, et aJ., 
Eds., Greene Publishing and Wiley-Interscience, New York (1995). 

Unless otherwise stated, sequence identity/similarity values provided 
herein refer to the value obtained using the BLAST 2.0.1 suite of programs using 

10 default parameters. Altschul et al. , Nucleic Acids Res. 25:3389-3402 (1997). 

As those of ordinary skill in the art will understand, BLAST searches 
assume that proteins can be modeled as random sequences. However, many real 
proteins comprise regions of nonrandom sequences which may be homopolymeric 
tracts, short-period repeats, or regions enriched in one or more amino acids. Such low- 

15 complexity regions may be aligned between unrelated proteins even though other 
regions of the protein are entirely dissimilar. A number of low-complexity filter 
programs can be employed to reduce such low-complexity alignments. For example, 
the SEG (Wooten and Federhen, Comput. Chem., 17:149-163 (1993)) and XNU 
(Claverie and States, Comput. Chem., 17:191-201 (1993)) low-complexity filters can be 

20 employed alone or in combination. 

(c) As used herein, "sequence identity" or "identity" in the 
context of two nucleic acid or polypeptide sequences includes reference to the residues 
in the two sequences which are the same when aligned for maximum correspondence 
over a specified comparison window. When percentage of sequence identity is used in 
reference to proteins it is recognized that residue positions which are not identical often 
differ by conservative amino acid substitutions, where amino acid residues are 
substituted for other amino acid residues with similar chemical properties (e.g. charge 
or hydrophobicity) and therefore do not change the functional properties of the 
molecule. Where sequences differ in conservative substitutions, the percent sequence 
identity may be adjusted upwards to correct for the conservative nature of the 
substitution. Sequences which differ by such conservative substitutions are said to have 
"sequence similarity" or "similarity". Means for making this adjustment are well- 
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known to those of skill in the art. Typically this involves scoring a conservative 
substitution as a partial rather than a full mismatch, thereby increasing the percentage 
sequence identity. Thus, for example, where an identical amino acid is given a score of 
1 and a non-conservative substitution is given a score of zero, a conservative 
substitution is given a score between zero and 1. The scoring of conservative 
substitutions is calculated, e.g., according to the algorithm of Meyers and Miller, 
Computer Applic. Biol. Sci., 4: 11-17 (1988) e.g., as implemented in the program 
PC/GENE (Intelligenetics, Mountain View, California, USA). 

(d) As used herein, "percentage of sequence identity" means the 
value determined by comparing two optimally aligned sequences over a comparison 
window, wherein the portion of the polynucleotide sequence in the comparison window 
may comprise additions or deletions (i.e., gaps) as compared to the reference sequence 
(which does not comprise additions or deletions) for optimal alignment of the two 
sequences. The percentage is calculated by deternuning the number of positions at 
which the identical nucleic acid base or amino acid residue occurs in both sequences to 
yield the number of matched positions, dividing the number of matched positions by the 
total number of positions in the window of comparison and multiplying the result by 
100 to yield the percentage of sequence identity. 

(e) (i) The term "substantial identity" of polynucleotide sequences 
means that a polynucleotide comprises a sequence that has at least 70% sequence 
identity, preferably at least 80%, more preferably at least 90% and most preferably at 
least 95%, compared to a reference sequence using one of the alignment programs 
described using standard parameters. One of skill will recognize that these values can 
be appropriately adjusted to determine corresponding identity of proteins encoded by 
two nucleotide sequences by taking into account codon degeneracy, amino acid 
similarity, reading frame positioning and the like. Substantial identity of amino acid 
sequences for these purposes normally means sequence identity of at least 60%, more 
preferably at least 70%, 80%, 90%, and most preferably at least 95%. 

Another indication that nucleotide sequences are substantially identical is 
if two molecules hybridize to each other under stringent conditions. However, nucleic 
acids which do not hybridize to each other under stringent conditions are still 
substantially identical if the polypeptides which they encode are substantially identical. 
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This may occur, e.g., when a copy of a nucleic acid is created using the maximum 
codon degeneracy permitted by the genetic code. One indication that two nucleic acid 
sequences are substantially identical is that the polypeptide which the first nucleic acid 
encodes is immunologically cross reactive with the polypeptide encoded by the second 
5 nucleic acid. 

(e) (ii) The terms "substantial identity" in the context of a peptide 
indicates that a peptide comprises a sequence with at least 70% sequence identity to a 
reference sequence, preferably 80%, more preferably 85%, most preferably at least 
90% or 95% sequence identity to the reference sequence over a specified comparison 

0 window. Preferably, optimal alignment is conducted using the homology alignment 
algorithm of Needleman and Wunsch, 7. MoL Biol 48: 443 (1970). An indication that 
two peptide sequences are substantially identical is that one peptide is immunologically 
reactive with antibodies raised against the second peptide. Thus, a peptide is 
substantially identical to a second peptide, for example, where the two peptides differ 

5 only by a conservative substitution. Peptides which are "substantially similar" share 
sequences as noted above except that residue positions which are not identical may 
differ by conservative amino acid changes. 

DETAILED DESCRIPTION OF THE INVENTION 
0 pygryi^ 

The present invention provides, inter alia, compositions and methods for 
modulating (i.e., increasing or decreasing) the total levels of proteins of the present 
invention and/or altering their ratios in plants. Thus, the present invention provides 
utility in such exemplary applications as improving the digestibility of fodder crops, 

5 increasing the value of plant material for pulp and paper production, improving the 
standability of crops, as well as for improving the utility of plant material where lignin 
content or composition is important, such as the use of plant lignins as a chemical 
feedstock, or the use of hyperlignified plant material for use as a fuel source. In 
particular, the polypeptides of the present invention can be expressed at times or in 

0 quantities which are not characteristic of non-recombinant plants. 

The present invention also provides isolated nucleic acid comprising 
polynucleotides of sufficient length and complementarity to a lignin biosynthesis gene to 
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use as probes or amplification primers in the detection, quantitation, or isolation of 
gene transcripts. For example, isolated nucleic acids of the present invention can be 
used as probes in detecting deficiencies in the level of mRNA in screenings for desired 
transgenic plants, for detecting mutations in the gene (e.g., substitutions, deletions, or 
5 additions), for monitoring upregulation of expression or changes in enzyme activity in 
screening assays of compounds, for detection of any number of allelic variants 
(polymorphisms) of the gene, or for use as molecular markers in plant breeding 
programs. The isolated nucleic acids of the present invention can also be used for 
recombinant expression of lignin biosynthesis polypeptides, or for use as immunogens 

10 in the preparation and/or screening of antibodies. The isolated nucleic acids of the 
present invention can also be employed for use in sense or antisense suppression of one 
or more lignin biosynthesis genes in a host cell, tissue, or plant. Attachment of 
chemical agents which bind, intercalate, cleave and/or crosslink to the isolated nucleic 
acids of the present invention can also be used to modulate transcription or translation. 

15 Further, using a primer specific to an insertion sequence (e.g., transposon) and a 
primer which specifically hybridizes to an isolated nucleic acid of the present invention, 
one can use nucleic acid amplification to identity insertion sequence inactivated lignin 
biosynthesis genes from a cDNA library prepared from insertion sequence mutagenized 
plants. Progeny seed from the plants comprising the desired inactivated gene can be 

20 grown to a plant to study the phenotypic changes characteristic of that inactivation. See, 
Tools to Determine the Function of Genes, 1995 Proceedings of the Fiftieth Annual 
Com and Sorghum Industry Research Conference, American Seed Trade Association, 
Washington, D.C., 1995. Additionally, non-translated 5' or 3* regions of the 
polynucleotides of the present invention can be used to modulate turnover of 

25 heterologous mRNAs and/or protein synthesis. Further, the codon preference 
characteristic of the polynucleotides of the present invention can be employed in 
heterologous sequences, or altered in homologous or heterologous sequences, to 
modulate translational level and/or rates. 

The present invention also provides isolated proteins comprising 

30 polypeptides including an amino acid sequence from the tignin biosynthesis 
polypeptides (e.g., preproenzyme, proenzyme, or enzymes) as disclosed herein. The 
present invention also provides proteins comprising at least one epitope from a tignin 
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biosynthesis polypeptide. The proteins of the present invention can be employed in 
assays for enzyme agonists or antagonists of enzyme function, or for use as 
immunogens or antigens to obtain antibodies specifically immunoreactive with a protein 
of the present invention. Such antibodies can be used in assays for expression levels, 
5 for identifying and/or isolating nucleic acids of the present invention from expression 
libraries, or for purification of lignin biosynthesis polypeptides. 

The isolated nucleic acids of the present invention can be used over a 
broad range of plant types, including species from the genera Cucurbita, Rosa, Vitis, 
Juglans, Fragaria, Lotus, Medicago, Onobrychis, Trifolium, Trigonella, Vigna, Citrus, 

10 Linum, Geranium, Manihot, Daucus, Arabidopsis, Brassica, Raphanus, Sinapis, 
Atropa, Capsicum, Datura, Hyoscyamus, Lycopersicon, Nicotiana, Solanum, Petunia, 
Digitalis, Majorana, Ciahoriwn, Helianthus, Lactuca, Bromus, Asparagus, 
Antirrhinum, Heterocallis, Nemesis, Pelargonium, Panieum, Pennisetum, Ranunculus, 
Senecio, Salpiglossis, Cucumis, Browaalia, Glycine, Pisum, Phaseolus, Lolium, Oryza, 

15 Zea, Avena, Hordeum, Secale, Triticum, Sorghum, Picea, and Populus. 
Nucleic Adds 

The present invention provides, inter alia, isolated nucleic acids of 
RNA, DNA, and analogs and/or chimeras thereof, comprising a lignin biosynthesis 
polynucleotide encoding such enzymes as: cinnamate-4-hydroxylase (C4H), 4- 

20 coumarate-3-hydroxylase (C3H), caffeic O-methyltransferase (C-OMT), ferulate-5- 
hydroxylase (F5H), caffeoyl-CoA 3-O-methyltransferase (CCoA-OMT), 4- 
coumarate:CoA ligase (4CL), cinnamoyl-CoA reductase (CCR), cinnamyl alcohol 
dehydrogenase (CAD), as well as diphenyl oxidase (DPO), a laccase involved in 
monomer polymerization. 

25 The lignin biosynthesis nucleic acids of the present invention comprise an 

isolated lignin biosynthesis polynucleotides which, are inclusive of: 

(a) a polynucleotide encoding a lignin biosynthesis polypeptide of SEQ 
ID NOS: 1-18 and 73-75 and conservatively modified and polymorphic variants 
thereof, including exemplary polynucleotides of SEQ ID NOS: 19-36 and 76-78; 

30 (b) a polynucleotide which is the product of amplification from a Zea 

mays nucleic acid library using primer pairs from amongst the consecutive pairs from 
SEQ ID NOS: 37-72 and 79-84, which amplify polynucleotides having substantial 
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identity to polynucleotides from amongst those having SEQ ID NOS: 19-36 and 76-78; 

(c) a polynucleotide which selectively hybridizes to a polynucleotide of 

(a) or (b); 

(d) a polynucleotide having at least 60% sequence identity with 
5 polynucleotides of (a), (b), or (c); 

(e) a polynucleotide encoding a protein having a specified number of 
contiguous amino acids from a prototype polypeptide, wherein the protein is 
specifically recognized by antisera elicited by presentation of the protein and wherein 
the protein does not detectably immunoreact to antisera which has been fully 

0 immunosorbed with the protein; 

(f) complementary sequences of polynucleotides of (a), (b), (c), (d), or 

(e); and 

(g) a polynucleotide comprising at least 15 contiguous nucleotides from a 
polynucleotide of (a), (b), (c), (d), (e), or (f). 

5 

A. Polynucleotides Encoding A Protein of SEQ ID NOS: 1-18 and 73-75 or 
Conservatively Modified or Polymorphic Variants Thereof 

As indicated in (a), supra, the present invention provides isolated 
0 heterologous nucleic acids comprising a lignin biosynthesis polynucleotide, wherein the 
polynucleotide encodes a lignin biosynthesis polypeptide, disclosed herein in SEQ ID 
NOS: 1-18 and 73-75, or conservatively modified or polymorphic variants thereof. 
Those of skill in the art will recognize that the degeneracy of the genetic code allows 
for a plurality of polynucleotides to encode for the identical amino acid sequence. Such 
5 "silent variations" can be used, for example, to selectively hybridize and detect allelic 
variants of polynucleotides of the present invention. Accordingly, the present invention 
includes polynucleotides of SEQ ID NOS: 19-36 and 76-78, and silent variations of 
polynucleotides encoding a polypeptide of SEQ ID NOS: 1-18 and 73-75. The present 
invention further provides isolated nucleic acids comprising polynucleotides encoding 
0 conservatively modified variants of a polypeptide of SEQ ID NOS: 1-18 and 73-75. 
Conservatively modified variants can be used to generate or select antibodies 
immunoreactive to the non-variant polypeptide. Additionally, the present invention 
further provides isolated nucleic acids comprising polynucleotides encoding one or 
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more polymorphic (allelic) variants of polypeptides/polynucleotides. Polymorphisms 
are frequently used to follow segregation of chromosomal regions in, for example, 
marker assisted selection methods for crop improvement. Exemplary polymorphisms 
are provided in Table I. 





SEO. ID NO.: 


2Q 






Position of Polymorphism 
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4-coumarate:CoA ligase is coded for by the polypeptides of SEQ ID 
NOS: 1, 2, and 3 which are encoded for by the nucleic acids of SEQ ID NOS:19, 20, 
and 21, respectively* 

10 Caffeic O-methyltransferase (C-OMT) is coded for by the polypeptides 

of SEQ ID NOS: 4, 5, 6, and 7 which are encoded for by the nucleic acids of SEQ ID 
NOS: 22, 23, 24, and 25, respectively. 

Cinnamate-4-hydroxylase (C4H) is coded for by the polypeptides of SEQ 
ID NOS: 8 and 9 which are encoded for by the nucleic acids of SEQ ID NOS: 26 and 
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27, respectively. 

Cinnamyl alcohol dehydrogenase (CAD) is coded for by the polypeptides 
of SEQ ID NOS: 10, 1 1 and 12 which are encoded for by the nucleic acids of SEQ ID 
NOS: 28, 29, and 30, respectively. 
5 Caffeoyl-CoA 3-O-methyltransferase (CCoA-OMT) is coded for by the 

polypeptides of SEQ ID NOS: 13, 14, 15, and 74 which are encoded for by the nucleic 
adds of SEQ ID NOS: 31, 32, 33, and 77, respectively. 

Cinnamoyl-CoA reductase (CCR) is coded for by the polypeptides of 
SEQ ID NO: 34 which is encoded for by the nucleic acid of SEQ ID NO: 16. 
10 A partial sequence for ferulate-5-hydroxylase (F5H) is coded for by the 

polypeptide of SEQ ID NO: 35 which is encoded for by the nucleic acid of SEQ ID 
NO: 17. 

A partial sequence for diphenyl oxidase (DPO) is coded for by the 
polypeptides of SEQ ID NO: 36 which is encoded for by the nucleic acid of SEQ ID 
15 NO:18. 

Ferulate-5-hydroxylase (F5H) is coded for by the polypeptide of SEQ ID 
NO: 73 which is encoded for by the nucleic acid of SEQ ID NO: 76. 

Diphenyl oxidase (DPO) is coded for by the polypeptide of SEQ ID NO: 
75 which is encoded for by the nucleic acid of SEQ ID NO:78. 

20 

B. Polynucleotides Amplified from a Zea mays Nucleic Acid library 
As indicated in (b), supra, the present invention provides isolated nucleic acids 
comprising lignin biosynthesis polynucleotides, wherein the polynucleotides are 
amplified from a Zea mays nucleic acid library. Zea mays lines B73, PHRE1, A632, 

25 BMS-P2#10, W23, and Mol7 are known and publicly available. Other publicly 
known and available maize lines can be obtained from the Maize Genetics Cooperation 
(Urbana, IL). The nucleic acid library may be a cDNA library, a genomic library, or a 
library generally constructed from nuclear transcripts at any stage of intron processing. 
Generally, a cDNA nucleic acid library will be constructed to comprise a majority of 

30 full-length cDNAs. Often, cDNA libraries will be normalized to increase the 
representation of relatively rare cDNAs. In preferred embodiments, the cDNA library 
is constructed mature lignified tissue such as root, leaf, or tassel tissue. The cDNA 
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library can be constructed using a full-length cDNA synthesis method. Examples of such 
methods include Oligo-Capping (Maruyama, K. and Sugano, S. Gene 138: 171-174, 
1994), Biotinylated CAP Trapper (Carninci, P., Kvan, C, et al Genomics 37: 327-336, 
1996), and CAP Retention Procedure (Edery, E., Chu, LX., et al. Molecular and Cellular 
5 Biology 15: 3363-3371, 1995). cDNA synthesis is preferably catalyzed at 50-55°C to 
prevent formation of RNA secondary structure. Examples of reverse transcriptases that 
are relatively stable at these temperatures are Superscript H Reverse Transcriptase (Life 
Technologies, Inc.), AMV Reverse Transcriptase (Boehringer Mannheim) and RetroAmp 
Reverse Transcriptase (Epicentre). Rapidly growing tissues, or rapidly dividing cells are 
10 preferably used as mRNA sources. 

The polynucleotides of the present invention include those amplified 
using the following primer pairs: 

SEQ ID NOS: 37 and 38 which yield an amplicon comprising a sequence having 
substantial identity to SEQ ID NO: 19; 
15 SEQ ID NOS: 39 and 40 which yield an amplicon comprising a sequence having 
substantial identity to SEQ ID NO:20; 

SEQ ID NOS: 41 and 42 which yield an amplicon comprising a sequence having 
substantial identity to SEQ ID NO:21; 

SEQ ID NOS: 43 and 44 which yield an amplicon comprising a sequence having 
20 substantial identity to SEQ ID NO:22; 

SEQ ID NOS: 45 and 46 which yield an amplicon comprising a sequence having 
substantial identity to SEQ ID NO:23; 

SEQ ID NOS: 47 and 48 which yield an amplicon comprising a sequence having 
substantial identity to SEQ ID NO:24; 
25 SEQ ID NOS: 49 and 50 which yield an amplicon comprising a sequence having 
substantial identity to SEQ ID NO:25; 

SEQ ID NOS: 51 and 52 which yield an amplicon comprising a sequence having 
substantial identity to SEQ ID NO:26; 

SEQ ID NOS: 53 and 54 which yield an amplicon comprising a sequence having 
30 substantial identity to SEQ ID NO:27; 

SEQ ID NOS: 55 and 56 which yield an amplicon comprising a sequence having 
substantial identity to SEQ ID NO:28; 
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SEQ ID NOS: 57 and 58 which yield an amplicon comprising a sequence having 
substantial identity to SEQ ID NO:29; 

SEQ ID NOS: 59 and 60 which yield an amplicon comprising a sequence having 
substantial identity to SEQ ID NO:30; 
5 SEQ ID NOS: 61 and 62 which yield an amplicon comprising a sequence having 
substantial identity to SEQ ID NO:31; 

SEQ ID NOS: 63 and 64 which yield an amplicon comprising a sequence having 
substantial identity to SEQ ID NO:32; 

SEQ ID NOS: 65 and 66 which yield an amplicon comprising a sequence having 
10 substantial identity to SEQ ID NO:33; 

SEQ ID NOS: 67 and 68 which yield an amplicon comprising a sequence having 
substantial identity to SEQ ID NO:34; 

SEQ ID NOS: 69 and 70 which yield an amplicon comprising a sequence having 
substantial identity to SEQ ID NO:35; 
15 SEQ ID NOS: 71 and 72 which yield an amplicon comprising a sequence having 
substantial identity to SEQ ID NO:36. 

SEQ ID NOS: 79 and 80 which yield an amplicon comprising a sequence having 
substantial identity to SEQ ID NO:76. 

SEQ ID NOS: 81 and 82 which yield an amplicon comprising a sequence having 

20 substantial identity to SEQ ID NO:77. 

SEQ ID NOS: 83 and 84 which yield an amplicon comprising a sequence having 
substantial identity to SEQ ID NO:78. 

The present invention also provides subsequences of full-length nucleic 
acids. Any number of subsequences can be obtained by reference to SEQ ID NOS: 19- 

25 36 and 76-78, and using primers which selectively amplify, under stringent conditions 
to: at least two sites to the polynucleotides of the present invention, or to two sites 
within the nucleic acid which flank and comprise a polynucleotide of the present 
invention, or to a site within a polynucleotide of the present invention and a site within 
the nucleic acid which comprises it. A variety of methods for obtaining 5* and/or 3' 

30 ends is well known in the art. See, e.g., RACE (Rapid Amplification of 
Complementary Ends) as described in Frohman, M. A., in PCR Protocols: A Guide to 
Methods and Applications, M. A. Innis, D. H. Gelfand, J. J. Sninsky, T. J. White, Eds. 
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(Academic Press, Inc., San Diego, 1990), pp. 28-38.); see also, U.S. Pat. No. 5,470,722, 
and Current Protocols in Molecular Biology, Unit 15.6, Ausubel, ex aJ. y Eds., Greene 
Publishing and Wiley-Interscience, New York (1995). Thus, the present invention 
provides lignin biosynthesis polynucleotides having the sequence of the lignin 
5 biosynthesis gene, nuclear transcript, cDNA, or complementary sequences and/or 
subsequences thereof. 

Primer sequences can be obtained by reference to a contiguous 
subsequence of a polynucleotide of the present invention. Primers are chosen to 
selectively hybridize, under PCR amplification conditions, to a polynucleotide of the 

10 present invention in an amplification mixture comprising a genomic and/or cDNA 
library from the same species. Generally, the primers are complementary to a 
subsequence of the amplicon they yield. In some embodiments, the primers will be 
constructed to anneal at their 5* terminal end's to the codon encoding the carboxy or 
amino terminal amino acid residue (or the complements thereof) of the polynucleotides 

15 of the present invention. The primer length in nucleotides is selected from the group of 
integers consisting of from at least 15 to 50. Thus, the primers can be at least 15, 18, 
20, 25, 30, 40, or 50 nucleotides in length. A non-annealing sequence at the 5'end of 
the primer (a "tail") can be added, for example, to introduce a cloning site at the 
terminal ends of the amplicon. 

20 The amplification primers may optionally be elongated in the 3* direction 

with additional contiguous nucleotides from the polynucleotide sequences, such as SEQ 
ID NOS: 19-36 and 76-78, from which they are derived. The number of nucleotides 
by which the primers can be elongated is selected from the group of integers consisting 
of from at least 1 to 25. Thus, for example, the primers can be elongated with an 

25 additional 1, 5, 10, or 15 nucleotides. Those of skill will recognize that a lengthened 
primer sequence can be employed to increase specificity of binding (i.e., annealing) to 
a target sequence. 

The amplification products can be translated using expression systems 
well known to those of skill in the art and as discussed, infra. The resulting translation 
30 products can be confirmed as polypeptides of the present invention by, for example, 
assaying for the appropriate catalytic activity (e.g., specific activity and/or substrate 
specificity), or verifying the presence of one or more linear epitopes which are specific 
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to a polypeptide of the present invention. Methods for protein synthesis from PCR 
derived templates are known in the art and available commercially. See, e.g., 
Amersham Life Sciences, Inc, Catalog '97, p.354. 

5 C. Polynucleotides Which Selectively Hybridize to a Polynucleotide of (A) or (B) 

As indicated in (c), supra, the present invention provides isolated nucleic 
acids comprising lignin biosynthesis polynucleotides, wherein the polynucleotides 
selectively hybridize, under selective hybridization conditions, to a polynucleotide of 
paragraphs (A) or (B) as discussed, supra. Thus, the polynucleotides of this 

10 embodiment can be used for isolating, detecting, and/or quantifying nucleic acids 
comprising the polynucleotides of (A) or (B). For example, polynucleotides of the 
present invention can be used to identify, isolate, or amplify partial or full-length clones 
in a deposited library. In some embodiments, the polynucleotides are genomic or 
cDNA sequences isolated from a Zea mays nucleic acid library. Preferably, the cDNA 

15 library comprises at least 80% full-length sequences, preferably at least 85% or 90% 
full-length sequences, and more preferably at least 95% full-length sequences. The 
cDNA libraries can be normalized to increase the representation of rare sequences. 
Low stringency hybridization conditions are typically, but not exclusively, employed 
with sequences having a reduced sequence identity relative to complementary 

20 sequences. Moderate and high stringency conditions can optionally be employed for 
sequences of greater identity. Low stringency conditions allow selective hybridization 
of sequences having about 70% sequence identity and can be employed to identify 
orthologous or paralogous sequences. 

25 D. Polynucleotides Having at Least 60% Sequence Identity with the Polynucleotides 
of(A),(B)or(Q 

As indicated in (d), supra, the present invention provides isolated nucleic 
acids comprising lignin biosynthesis polynucleotides, wherein the polynucleotides have 
30 a specified identity at the nucleotide level to a polynucleotide as disclosed above in 
paragraphs (A), (B), or (C). The percentage of identity to a reference sequence is at 
least 60% and, rounded upwards to the nearest integer, can be expressed as an integer 
selected from the group of integers consisting of from 60 to 99. Thus, for example, the 
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percentage of identity to a reference sequence can be at least 70%, 75%, 80%, 85%, 
90%, or 95%. 

Optionally, the polynucleotides of this embodiment will share an epitope 
with a polypeptide encoded by the polynucleotides of (A), (B), or (C). Thus, these 
5 polynucleotides encode a first polypeptide which elicits production of antisera 
comprising antibodies which are specifically reactive to a second polypeptide encoded 
by a polynucleotide of (A), (B), or (C). However, the first polypeptide does not bind 
to antisera raised against itself when the antisera has been fully immunosorbed with the 
first polypeptide. Hence, the polynucleotides of this embodiment can be used to 

10 generate antibodies for use in, for example, the screening of expression libraries for 
nucleic acids comprising polynucleotides of (A), (B), or (C), or for purification of, or 
in immunoassays for, polypeptides encoded by the polynucleotides of (A), (B), or (C). 
The polynucleotides of this embodiment embrace nucleic acid sequences which can be 
employed for selective hybridization to a polynucleotide encoding a polypeptide of the 

15 present invention. 

Screening polypeptides for specific binding to antisera can be 
conveniently achieved using peptide display libraries. This method involves the 
screening of large collections of peptides for individual members having the desired 
function or structure. Antibody screening of peptide display libraries is well known in 

20 the art. The displayed peptide sequences can be from 3 to 5000 or more amino acids in 
length, frequently from 5-100 amino acids long, and often from about 8 to 15 amino 
acids long. In addition to direct chemical synthetic methods for generating peptide 
libraries, several recombinant DNA methods have been described. One type involves 
the display of a peptide sequence on the surface of a bacteriophage or cell. Each 

25 bacteriophage or cell contains the nucleotide sequence encoding the particular displayed 
peptide sequence. Such methods are described in PCT patent publication Nos. 
91/17271, 91/18980, 91/19818, and 93/08278. Other systems for generating libraries 
of peptides have aspects of both in vitro chemical synthesis and recombinant methods. 
See, PCT Patent publication Nos. 92/05258, 92/14843, and 96/19256. See also, U.S. 

30 Patent Nos. 5,658,754; and 5,643,768. Peptide display libraries, vectors, and 
screening kits are commercially available from such suppliers as Invitrogen (Carlsbad, 
CA). 
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E. Polynucleotides Encoding a Protein Having a Subsequence from a Prototype 
Polypeptide and is Cross-Reactive to the Prototype Polypeptide 

5 As indicated in (e), supra, the present invention provides isolated nucleic 

acids comprising lignin biosynthesis polynucleotides, wherein the polynucleotides 
encode a protein having a subsequence of contiguous amino acids from a prototype 
lignin biosynthesis polypeptide. Exemplary prototype lignin biosynthesis polypeptides 
are provided in SEQ ID NOS: 1-18 and 73-75. The length of contiguous amino acids 

10 from the prototype polypeptide is selected from the group of integers consisting of from 
at least 10 to the number of amino acids within the prototype sequence. Thus, for 
example, the polynucleotide can encode a polypeptide having a subsequence having at 
least 10, 15, 20, 25, 30, 35, 40, 45, or 50, contiguous amino acids from the prototype 
polypeptide. Further, the number of such subsequences encoded by a polynucleotide of 

15 the instant embodiment can be any integer selected from the group consisting of from 1 
to 20, such as 2, 3, 4, or 5. The subsequences can be separated by any integer of 
nucleotides from 1 to the number of nucleotides in the sequence such as at least 5, 10, 
15, 25, 50, 100, or 200 nucleotides. 

The proteins encoded by polynucleotides of this embodiment, when 

20 presented as an immunogen, elicit the production of polyclonal antibodies which 
specifically bind to a prototype polypeptide such as, but not limited to, a polypeptide 
encoded by the polynucleotide of (b), supra, or exemplary polypeptides of SEQ ID 
NOS: 1-18 and 73-75. Generally, however, a protein encoded by a polynucleotide of 
this embodiment does not bind to antisera raised against the prototype polypeptide when 

25 the antisera has been fully immunosorbed with the prototype polypeptide. Methods of 
making and assaying for antibody binding specificity/affinity are well known in the art 
Exemplary immunoassay formats include ELISA, competitive immunoassays, 
radioimmunoassays, Western blots, indirect immunofiuorescent assays and the like. 

In a preferred assay method, fully immunosorbed and pooled antisera 

30 which is elicited to the prototype polypeptide can be used in a competitive binding assay 
to test the protein. The concentration of the prototype polypeptide required to inhibit 
50% of the binding of the antisera to the prototype polypeptide is determined. If the 
amount of the protein required to inhibit binding is less than twice the amount of the 
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prototype protein, then the protein is said to specifically bind to the antisera elicited to 
the immunogen. Accordingly, the proteins of the present invention embrace allelic 
variants, conservatively modified variants, and minor recombinant modifications to a 
prototype polypeptide. 

5 A polynucleotide of the present invention optionally encodes a protein 

having a molecular weight as the non-glycosylated protein within 20% of the molecular 
weight of the full-length non-glycosylated lignin biosynthesis polypeptides as disclosed 
herein (e.g., SEQ ID NOS:l-18 and 73-75). Molecular weight can be readily 
determined by SDS-PAGE under reducing conditions. Preferably, the molecular 

10 weight is within 15% of a full length lignin biosynthesis polypeptide, more preferably 
within 10% or 5%, and most preferably within 3%, 2%, or 1% of a full length lignin 
biosynthesis polypeptide of the present invention. Molecular weight determination of a 
protein can be conveniently performed by SDS-PAGE under denaturing conditions. 

Optionally, the polynucleotides of this embodiment will encode a protein 

15 having a specific activity at least 20%, 30%, 40%, or 50% of the native, endogenous 
(i.e., non-isolated), full-length lignin biosynthesis polypeptide. Further, the proteins 
encoded by polynucleotides of this embodiment will optionally have a substantially 
similar apparent dissociation constant (K^ ) and/or catalytic activity (i.e., the 
microscopic rate constant, kj) as the native endogenous, full-length lignin biosynthesis 

20 protein. Those of skill in the art will recognize that KJK> value determines the 
specificity for competing substrates and is often referred to as the specificity constant. 
Proteins of this embodiment can have a KJJ^ value at least 10% of the non-isolated 
full-length lignin biosynthesis polypeptide as determined using the substrate of that 
polypeptide from the lignin biosynthesis specific pathways, supra. Optionally, the 

25 KJK^ value will be at least 20%, 30%, 40%, 50%, and most preferably at least 60%, 
70%, 80%, 90%, or 95% the KJK* value of the non-isolated, full-length lignin 
biosynthesis polypeptide. Determination of k^, K„ , and KJK^ can be determined by 
any number of means well known to those of skill in the art. For example, the initial 
rates (i.e., the first 5% or less of the reaction) can be determined using rapid mixing 

30 and sampling techniques (e.g., continuous-flow, stopped-flow, or rapid quenching 
techniques), flash photolysis, or relaxation methods (e.g., temperature jumps) in 
conjunction with such exemplary methods of measuring as spectrophotometry, 
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spectrofluorimetry , nuclear magnetic resonance, or radioactive procedures. Kinetic 
values are conveniently obtained using a Lineweaver-Burk or Eadie-Hof stee plot. 

F. Polynucleotides Complementary to the Polynucleotides of(A)-(E) 

5 As indicated in (f), supra, the present invention provides isolated nucleic 

acids comprising lignin biosynthesis polynucleotides, wherein the polynucleotides are 
complementary to the polynucleotides of paragraphs A-E, above. As those of skill in 
the art will recognize, complementary sequences base-pair throughout the entirety of 
their length with the polynucleotides of (A)-(E) (i.e., have 100% sequence identity over 
10 their entire length). Complementary bases associate through hydrogen bonding in 
double stranded nucleic acids. For example, the following base pairs are 
complementary: guanine and cytosine; adenine and thymine; and adenine and uracil. 

G. Polynucleotides Which are Subsequences of the Polynucleotides of(A)-(F) 

15 As indicated in (g), supra, the present invention provides isolated nucleic 

acids comprising lignin biosynthesis polynucleotides, wherein the polynucleotide 
comprises at least 15 contiguous bases from the polynucleotides of (A) through (F) as 
discussed above. The length of the polynucleotide is given as an integer selected from 
the group consisting of from at least 15 to the length of the nucleic acid sequence from 

20 which the polynucleotide is a subsequence of. Thus, for example, polynucleotides of 
the present invention are inclusive of polynucleotides comprising at least 15, 20, 25, 
30, 40, 50, 60, 75, or 100 contiguous nucleotides in length from the polynucleotides of 
(A)-(F). Optionally, the number of such subsequences encoded by a polynucleotide of 
the instant embodiment can be any integer selected from the group consisting of from 1 

25 to 20, such as 2, 3, 4, or 5. The subsequences can be separated by any integer of 
nucleotides from 1 to the number of nucleotides in the sequence such as at least 5, 10, 
15, 25, 50, 100, or 200 nucleotides. 

The subsequences of the present invention can comprise structural 
characteristics of the sequence from which it is derived. Alternatively, the subsequences 

30 can lack certain structural characteristics of the larger sequence from which it is derived. 
For example, a subsequence from a polynucleotide encoding a polypeptide having at least 
one linear epitope in common with a prototype sequence, such as SEQ ID NOS: 1-18 and 
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10 



73-75, may encode an epitope in common with the prototype sequence. Alternatively, the 
subsequence may not encode an epitope in common with the prototype sequence but can 
be used to isolate the larger sequence by, for example, nucleic acid hybridization with the 
sequence from which it's derived. Subsequences can be used to modulate or detect gene 
expression by introducing into the subsequences compounds which bind, intercalate, 
cleave and/or crosslink to nucleic acids. Exemplary compounds include acridine, psoralen, 
phenanthroline, naphthoquinone, daunomycin or chloroethylaminoaryl conjugates. 

Construction of Nucleic Acids 

The isolated nucleic acids of the present invention can be made using (a) 
standard recombinant methods, (b) synthetic techniques, or combinations thereof. In 
some embodiments, the polynucleotides of the present invention will be cloned, 
amplified, or otherwise constructed from a monocot. In preferred embodiments the 
monocot is Zea mays. Particularly preferred is the use of Zea mays tissue from root, 
15 leaf, or tassel. 

The nucleic acids may conveniently comprise sequences in addition to a 
polynucleotide of the present invention. For example, a multi-cloning site comprising 
one or more endonuclease restriction sites may be inserted into the nucleic acid to aid in 
isolation of the polynucleotide. Also, translatable sequences may be inserted to aid in 

20 the isolation of the translated polynucleotide of the present invention. For example, a 
hexa-histidine marker sequence provides a convenient means to purify the proteins of 
the present invention. The nucleic acid of the present invention - excluding the 
polynucleotide sequence - is optionally a vector, adapter, or linker for cloning and/or 
expression of a polynucleotide of the present invention. Additional sequences may be 

25 added to such cloning and/or expression sequences to optimize their function in cloning 
and/or expression, to aid in isolation of the polynucleotide, or to improve the 
introduction of the polynucleotide into a cell. Typically, the length of a nucleic acid of 
the present invention less the length of its polynucleotide of the present invention is less 
than 20 Mobase pairs, often less than 15 kb, and frequently less than 10 kb. Use of 

30 cloning vectors, expression vectors, adaptors, and linkers is well known in the art. 
Exemplary nucleic acids include such vectors as: M13, lambda ZAP Express, lambda 
ZAP n, lambda gtlO, lambda gtll, pBK-CMV, pBK-RSV, pBluescript II, lambda 
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DASH H, lambda EMBL 3, lambda EMBL 4, pWE15, SuperCos 1, SurfZap, Uni- 
ZAP, pBC, pBS+/-, pSG5, pBK, pCR-Script, pET, pSPUTK, p3*SS, pOPRSVI CAT, 
pOPI3 CAT, pXTl, pSG5, pPbac, pMbac, pMClneo, pOG44, pOG45, pFRT0GAL, 
pNEOPGAL, pRS403, pRS404, pRS405, pRS406, pRS413, pRS414, pRS415, 
5 pRS416, lambda MOSSlox, and lambda MOSElox. For a description of various 
nucleic acids see, for example, Stratagene Cloning Systems, Catalogs 1995, 1996, 1997 
(La Jolla, CA); and, Amersham life Sciences, Inc, Catalog *97 (Arlington Heights, 
IL). 

10 A. Recombinant Methods for Constructing Nucleic Acids 

The isolated nucleic acid compositions of this invention, such as RNA, 
cDNA, genomic DNA, or a hybrid thereof, can be obtained from plant biological 
sources using any number of cloning methodologies known to those of skill in the art. 
In some embodiments, oligonucleotide probes which selectively hybridize, under 

15 stringent conditions, to the polynucleotides of the present invention are used to identify 
the desired sequence in a cDNA or genomic DNA library. While isolation of RNA, 
and construction of cDNA and genomic libraries is well known to those of ordinary 
skill in the art, the following highlights some of the methods employed. 

20 Al. mRNA Isolation and Purification 

Total RNA from plant cells comprises such nucleic acids as 
mitochondrial RNA, chloroplastic RNA, rRNA, tRNA, hnRNA and mRNA. Total 
RNA preparation typically involves lysis of cells and removal of proteins, followed by 
precipitation of nucleic acids. Extraction of total RNA from plant cells can be 

25 accomplished by a variety of means. Frequently, extraction buffers include a strong 
detergent such as SDS and an organic denaturant such as guanidinium isothiocyanate, 
guanidine hydrochloride or phenol. Following total RNA isolation, poly(A) + mRNA is 
typically purified from the remainder RNA using oligo(dT) cellulose. Exemplary total 
RNA and mRNA isolation protocols are described in Plant Molecular Biology: A 

30 Laboratory Manual, Clark, Ed., Springer- Verlag, Berlin (1997); and, Current 
Protocols in Molecular Biology, Ausubel, et al., Eds., Greene Publishing and Wiley- 
Interscience, New York (1995). Total RNA and mRNA isolation kits are commercially 
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available from vendors such as Stratagene (La Jolla, CA), Clonetech (Palo Alto, CA), 
Pharmacia (Piscataway, NJ), and 5'-3* (Paoli, PA). See also, U.S. Patent Nos. 
5,614,391; and, 5,459,253. The mRNA can be fractionated into populations with size 
ranges of about 0.5, 1.0, 1.5, 2.0, 2.5 or 3.0 kb. The cDNA synthesized for each of 
5 these fractions can be size selected to the same size range as its mRNA prior to vector 
insertion. This method helps eliminate truncated cDNA formed by incompletely 
reverse transcribed mRNA. 

A2. Construction of a cDNA Library 
10 Construction of a cDNA library generally entails five steps. First, first 

strand cDNA synthesis is initiated from a poly(A) + mRNA template using a poly(dT) 
primer or random hexanucleotides. Second, the resultant RNA-DNA hybrid is 
converted into double stranded cDNA, typically by a combination of RNAse H and 
DNA polymerase I (or Klenow fragment). Third, the termini of the double stranded 
15 cDNA are ligated to adaptors. Ligation of the adaptors will produce cohesive ends for 
cloning. Fourth, size selection of the double stranded cDNA eliminates excess adaptors 
and primer fragments, and eliminates partial cDNA molecules due to degradation of 
mRNAs or the failure of reverse transcriptase to synthesize complete first strands. 
Fifth, the cDNAs are ligated into cloning vectors and packaged. cDNA synthesis 
protocols are well known to the skilled artisan and are described in such standard 
references as: Plant Molecular Biology: A Laboratory Manual, Clark, Ed., Springer- 
Veriag, Berlin (1997); and, Current Protocols in Molecular Biology, Ausubel, et al., 
Eds., Greene Publishing and Wiley-Interscience, New York (1995). cDNA synthesis 
kits are available from a variety of commercial vendors such as: Stratagene, and 
25 Pharmacia. 

A number of cDNA synthesis protocols have been described which 
provide substantially pure full-length cDNA libraries. Substantially pure full-length 
cDNA libraries are constructed to comprise at least 90%, and more preferably at least 
93% or 95% full-length inserts amongst clones containing inserts. The length of insert 
in such libraries can be from 0 to 8, 9, 10, 11, 12, 13, or more kilobase pairs. Vectors 
to accommodate inserts of these sizes are known in the art and available commercially. 
See, e.g., Stratagene's lambda ZAP Express (cDNA cloning vector with 0 to 12 kb 
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cloning capacity). 

An exemplary method of constructing a greater than 95% pure full- 
length cDNA library is described by Carninci et al., Genomics, 37:327-336 (1996). In 
that protocol, the cap-structure of eukaryotic mRNA is chemically labeled with biotin. 
5 By using streptavidin-coated magnetic beads, only the full-length first-strand 
cDNA/mRNA hybrids are selectively recovered after RNase I treatment. The method 
provides a high yield library with an unbiased representation of the starting mRNA 
population. Other methods for producing full-length libraries are known in the art. 
See, e.g., Edery etal., Mol. Ce//flw/.,15(6):3363-3371 (1995); and, PCT Application 
10 WO 96/34981. 

A3. Normalized or Subtracted cDNA Libraries 

A non-normalized cDNA library represents the mRNA population of the 
tissue it was made from. Since unique clones are out-numbered by clones derived from 
15 highly expressed genes their isolation can be laborious. Normalization of a cDNA 
library is the process of creating a library in which each clone is more equally 
represented. 

A number of approaches to normalize cDNA libraries are known in the 
art. One approach is based on hybridization to genomic DNA. The frequency of each 

20 hybridized cDNA in the resulting normalized library would be proportional to that of 
each corresponding gene in the genomic DNA. Another approach is based on kinetics. 
If cDNA reannealing follows second-order kinetics, rarer species anneal less rapidly 
and the remaining single-stranded fraction of cDNA becomes progressively more 
normalized during the course of the hybridization. Specific loss of any species of 

25 cDNA, regardless of its abundance, does not occur at any Cot value. Construction of 
normalized libraries is described in Ko, Nucl. Acids. Res., 18(19):5705-5711 (1990); 
Patanjali et al., Proc. Natl Acad. U.S.A., 88:1943-1947 (1991); U.S. Patents 
5,482,685, and 5,637,685. In an exemplary method described by Soares et al., 
normalization resulted in reduction of the abundance of clones from a range of four 

30 orders of magnitude to a narrow range of only 1 order of magnitude. Proc. Natl. Acad. 
Sci. USA, 91:9228-9232 (1994). 
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Subtracted cDNA libraries are another means to increase the proportion 
of less abundant cDNA species. In this procedure, cDNA prepared from one pool of 
mRNA is depleted of sequences present in a second pool of mRNA by hybridization. 
The cDNArmRNA hybrids are removed and the remaining un-hybridized cDNA pool is 
5 enriched for sequences unique to that pool. See, Foote et al in, Plant Molecular 
Biology: A Laboratory Manual, Clark, Ed., Springer- Verlag, Berlin (1997); Kho and 
Zarbl, Technique, 3(2):58-63 (1991); Sive and St. John, Nucl. Acids Res., 
16(22): 10937 (1988); Current Protocols in Molecular Biology, Ausubel, et al., Eds., 
Greene Publishing and Wiley-Interscience, New York (1995); and, Swaroop et al., 
10 Nucl. Adds Res., 19)8): 1954 (1991). cDNA subtraction kits are commercially 
available. See, e.g., PCR-Select (Clontech). 

A4. Construction of a Genomic Library 

To construct genomic libraries, large segments of genomic DNA are 

15 generated by random fragmentation, e.g. using restriction endonucleases, and are 
ligated with vector DNA to form concatemers that can be packaged into the appropriate 
vector. Methodologies to accomplish these ends, and sequencing methods to verify the 
sequence of nucleic acids are well known in the art. Examples of appropriate molecular 
biological techniques and instructions sufficient to direct persons of skill through many 

20 construction, cloning, and screening methodologies are found in Sambrook, et al., 
Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory 
Vols. 1-3 (1989), Methods in Enzymology, Vol. 152: Guide to Molecular Cloning 
Techniques, Berger and Kimmel, Eds., San Diego: Academic Press, Inc. (1987), 
Current Protocols in Molecular Biology, Ausubel, et al., Eds., Greene Publishing and 

25 Wiley-Interscience, New York (1995); Plant Molecular Biology: A Laboratory Manual, 
Clark, Ed., Springer-Verlag, Berlin (1997). Kits for construction of genomic libraries 
are also commercially available. 

AS. Nucleic Acid Screening and Isolation Methods 
30 The cDNA or genomic library can be screened using a probe based upon 

the sequence of a polynucleotide of the present invention such as those disclosed herein. 
Probes may be used to hybridize with genomic DNA or cDNA sequences to isolate 
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homologous genes in the same or different plant species. Those of skill in the art will 
appreciate that various degrees of stringency of hybridization can be employed in the 
assay; and either the hybridization or the wash medium can be stringent. As the 
conditions for hybridization become more stringent, there must be a greater degree of 
5 complementarity between the probe and the target for duplex formation to occur. The 
degree of stringency can be controlled by temperature, ionic strength, pH and the 
presence of a partially denaturing solvent such as formamide. For example, the 
stringency of hybridization is conveniently varied by changing the polarity of the 
reactant solution through manipulation of the concentration of formamide within the 

10 range of 0% to 50%. The degree of complementarity (sequence identity) required for 
detectable binding will vary in accordance with the stringency of the hybridization 
medium and/or wash medium. The degree of complementarity will optimally be 100 
percent; however, it should be understood that minor sequence variations in the probes 
and primers may be compensated for by reducing the stringency of the hybridization 

IS and/or wash medium. 

The nucleic acids of interest can also be amplified from nucleic acid 
samples using amplification techniques. For instance, polymerase chain reaction (PCR) 
technology can be used to amplify the sequences of polynucleotides of the present 
invention and related genes directly from genomic DNA or cDNA libraries. PCR and 

20 other in vitro amplification methods may also be useful, for example, to clone nucleic 
acid sequences that code for proteins to be expressed, to make nucleic acids to use as 
probes for detecting the presence of the desired mRNA in samples, for nucleic acid 
sequencing, or for other purposes. Examples of techniques sufficient to direct persons 
of skill through in vitro amplification methods are found in Berger, Sambrook, and 

25 Ausubel, as well as Mullis et aL, U.S. Patent No. 4,683,202 (1987); and, PCR 
Protocols A Guide to Methods and Applications, Innis et al., Eds., Academic Press 
Inc., San Diego, CA (1990). Commercially available kits for genomic PCR 
amplification are known in the art. See, e.g., Advantage-GC Genomic PCR Kit 
(Clontech). The T4 gene 32 protein (Boehringer Mannheim) can be used to improve 

30 yield of long PCR products. 

PCR-based screening methods have also been described. Wilfinger et al. 
describe a PCR-based method in which the longest cDNA is identified in the first step 
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so that incomplete clones can be eliminated from study. BioTechniques, 22(3): 481-486 
(1997). In that method, a primer pair is synthesized with one primer annealing to the 
5* end of the sense strand of the desired cDNA and the other primer to the vector. 
Clones are pooled to allow large-scale screening. By this procedure, the longest 
5 possible clone is identified amongst candidate clones. Further, the PCR product is used 
solely as a diagnostic for the presence of the desired cDNA and does not utilize the 
PCR product itself. Such methods are particularly effective in combination with a full- 
length cDNA construction methodology, supra. 

10 B. Synthetic Methods for Constructing Nucleic Acids 

The isolated nucleic acids of the present invention can also be prepared 
by direct chemical synthesis by methods such as the phosphotriester method of Narang 
et al., Meth. Emymol. 68: 90-99 (1979); the phosphodiester method of Brown et al., 
Meth. Enzymol. 68: 109-151 (1979); the diethylphosphoramidite method of Beaucage et 

15 al., Tetra. Lett. 22: 1859-1862 (1981); the solid phase phosphoramidite triester method 
described by Beaucage and Caruthers, Tetra. Letts. 22(20): 1859-1862 (1981), e.g., 
using an automated synthesizer, e.g., as described in Needham-VanDevanter et al., 
Nucleic Acids Res., 12: 6159-6168 (1984); and, the solid support method of U.S. 
Patent No. 4,458,066. Chemical synthesis generally produces a single stranded 

20 oligonucleotide. This may be converted into double stranded DNA by hybridization 
with a complementary sequence, or by polymerization with a DNA polymerase using 
the single strand as a template. One of skill will recognize that while chemical 
synthesis of DNA is limited to sequences of about 100 bases, longer sequences may be 
obtained by the ligation of shorter sequences. 

25 

Recombinant Expression Cassettes 

The present invention further provides recombinant expression cassettes 
comprising a nucleic acid of the present invention. A nucleic acid sequence coding for 
the desired polynucleotide of the present invention, for example a cDNA or a genomic 
30 sequence encoding a full length polypeptide of the present invention, can be used to 
construct a recombinant expression cassette which can be introduced into the desired 
host cell. A recombinant expression cassette will typically comprise a polynucleotide of 
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the present invention operably linked to transcriptional initiation regulatory sequences 
which will direct the transcription of the polynucleotide in the intended host cell, such 
as tissues of a transformed plant. 

For example, plant expression vectors may include (1) a cloned plant 
5 gene under the transcriptional control of 5' and 3 1 regulatory sequences and (2) a 

dominant selectable marker. Such plant expression vectors may also contain, if desired, 
a promoter regulatory region (e.g., one conferring inducible or constitutive, 
environmentally- or developmentally-regulated, or cell- or tissue-specific/selective 
expression), a transcription initiation start site, a ribosome binding site, an RNA 

10 processing signal, a transcription termination site, and/or a polyadenylation signal. 

A plant promoter fragment can be employed which will direct expression 
of a polynucleotide of the present invention in all tissues of a regenerated plant. Such 
promoters are referred to herein as "constitutive" promoters and are active under most 
environmental conditions and states of development or cell differentiation. Examples of 

15 constitutive promoters include the cauliflower mosaic virus (CaMV) 35S transcription 
initiation region, the T- or 2'- promoter derived from T-DNA of Agrobacterium 
twnefaciens, the ubiquitin 1 promoter, the Smas promoter, the cinnamyl alcohol 
dehydrogenase promoter (U.S. Patent No. 5,683,439), the Nos promoter, the pEmu 
promoter, the rubisco promoter, the GRP1-8 promoter, and other transcription initiation 

20 regions from various plant genes known to those of skill. 

Alternatively, the plant promoter can direct expression of a 
polynucleotide of the present invention in a specific tissue or may be otherwise under 
more precise environmental or developmental control. Such promoters are referred to 
here as "inducible" promoters. Environmental conditions that may effect transcription 

25 by inducible promoters include pathogen attack, anaerobic conditions, or the presence 
of light. Examples of inducible promoters are the Adhl promoter which is inducible by 
hypoxia or cold stress, the Hsp70 promoter which is inducible by heat stress, and the 
PPDK promoter which is inducible by light. 

Examples of promoters under developmental control include promoters 

30 that initiate transcription only, or preferentially, in certain tissues, such as leaves, roots, 
fruit, seeds, or flowers. The operation of a promoter may also vary depending on its 
location in the genome. Thus, an inducible promoter may become fully or partially 
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constitutive in certain locations. 

Both heterologous and non-heterologous (i.e., endogenous) promoters 
can be employed to direct expression of the nucleic acids of the present invention. 
These promoters can also be used, for example, in recombinant expression cassettes to 
5 drive expression of antisense nucleic acids to reduce, increase, or alter lignin 
biosynthesis content and/or composition in a desired tissue. Thus, in some 
embodiments, the nucleic acid construct will comprise a promoter functional in a plant 
cell, such as in Zea mays, operably linked to a polynucleotide of the present invention. 
Promoters useful in these embodiments include the endogenous promoters driving 

10 expression of a polypeptide of the present invention. 

In some embodiments, isolated nucleic acids which serve as promoter or 
enhancer elements can be introduced in the appropriate position (generally upstream) of 
a non-heterologous form of a polynucleotide of the present invention so as to up or 
down regulate expression of a polynucleotide of the present invention. For example, 

15 endogenous promoters can be altered in vivo by mutation, deletion, and/or substitution 
(see, Kmiec, U.S. Patent 5,565,350; Zarling et al., PCT/US93/03868), or isolated 
promoters can be introduced into a plant cell in the proper orientation and distance from 
a lignin biosynthesis gene so as to control the expression of the gene. Gene expression 
can be modulated under conditions suitable for plant growth so as to alter lignin 

20 biosynthesis content and/or composition. Thus, the present invention provides 
compositions, and methods for making, heterologous promoters and/or enhancers 
operably linked to a native, endogenous (i.e., non-heterologous) form of a 
polynucleotide of the present invention. 

Methods for identifying promoters with a particular expression pattern, 

25 in terms of, e.g., tissue type, cell type, stage of development, and/or environmental 
conditions, are well known in the art. See, e.g., The Maize Handbook, Chapters 114- 
115, Freeling and Walbot, Eds., Springer, New York (1994); Corn and Corn 
Improvement, 3 ri edition, Chapter 6, Sprague and Dudley, Eds., American Society of 
Agronomy, Madison, Wisconsin (1988). A typical step in promoter isolation methods 

30 is identification of gene products that are expressed with some degree of specificity in 
the target tissue. Amongst the range of methodologies are: differential hybridization to 
cDNA libraries; subtractive hybridization; differential display; differential 2-D gel 
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electrophoresis; DNA probe arrays; and isolation of proteins known to be expressed 
with some specificity in the target tissue. Such methods are well known to those of 
skill in the art. Commercially available products for identifying promoters are known 
in the art such as C Ion tech's (Palo Alto, CA) Universal GenomeWalker Kit. 

5 For the protein-based methods, it is helpful to obtain the amino acid 

sequence for at least a portion of the identified protein, and then to use the protein 
sequence as the basis for preparing a nucleic acid that can be used as a probe to identify 
either genomic DNA directly, or preferably, to identify a cDNA clone from a library 
prepared from the target tissue. Once such a cDNA clone has been identified, that 

10 sequence can be used to identify the sequence at the 5' end of the transcript of the 
indicated gene. For differential hybridization, subtractive hybridization and differential 
display, the nucleic acid sequence identified as enriched in the target tissue is used to 
identify the sequence at the 5' end of the transcript of the indicated gene. Once such 
sequences are identified, starting either from protein sequences or nucleic acid 

15 sequences, any of these sequences identified as being from the gene transcript can be 
used to screen a genomic library prepared from the target organism. Methods for 
identifying and confirming the transcriptional start site are well known in the art. 

In the process of isolating promoters expressed under particular 
environmental conditions or stresses, or in specific tissues, or at particular 

20 developmental stages, a number of genes are identified that are expressed under the 
desired circumstances, in the desired tissue, or at the desired stage. Further analysis 
will reveal expression of each particular gene in one or more other tissues of the plant. 
One can identify a promoter with activity in the desired tissue or condition but that do 
not have activity in any other common tissue. 

25 To identify the promoter sequence, the 5 1 portions of the clones 

described here are analyzed for sequences characteristic of promoter sequences. For 
instance, promoter sequence elements include the TATA box consensus sequence 
(TATAAT), which is usually an AT-rich stretch of 5-10 bp located approximately 20 to 
40 base pairs upstream of the transcription start site. Identification of the TATA box is 

30 well known in the art. For example, one way to predict the location of this element is 
to identify the transcription start site using standard RNA-mapping techniques such as 
primer extension, SI analysis, and/or RNase protection. To confirm the presence of 
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the AT-rich sequence, a structure-function analysis can be performed involving 
mutagenesis of the putative region and quantification of the mutation's effect on 
expression of a linked downstream reporter gene. See, e.g., The Maize Handbook* 
Chapter 114, Freeling and Walbot, Eds., Springer, New York, (1994). 
5 In plants, further upstream from the TATA box, at positions -80 to -100, 

there is typically a promoter element (i.e., the CAAT box) with a series of adenines 
surrounding the trinucleotide G (or T) N G. J. Messing et al. f in Genetic Engineering 
in Plants, Kosage, Meredith and Hollaender, Eds., pp. 221-227 1983. In maize, there 
is no well conserved CAAT box but there are several short, conserved protein-binding 

10 motifs upstream of the TATA box. These include motifs for the trans-acting 
transcription factors involved in light regulation, anaerobic induction, hormonal 
regulation, or anthocyanin biosynthesis, as appropriate for each gene. 

Once promoter and/or gene sequences are known, a region of suitable 
size is selected from the genomic DNA that is 5* to the transcriptional start, or the 

IS translational start site, and such sequences are then linked to a coding sequence. If the 
transcriptional start site is used as the point of fusion, any of a number of possible 5* 
untranslated regions can be used in between the transcriptional start site and the partial 
coding sequence. If the translational start site at the 3' end of the specific promoter is 
used, then it is linked directly to the methionine start codon of a coding sequence. 

20 If polypeptide expression is desired, it is generally desirable to include a 

polyadenylation region at the 3'-end of a polynucleotide coding region. The 
polyadenylation region can be derived from the natural gene, from a variety of other 
plant genes, or from T-DNA. The 3' end sequence to be added can be derived from, 
for example, the nopaline synthase or octopine synthase genes, or alternatively from 

25 another plant gene, or less preferably from any other eukaryotic gene. 

An intron sequence can be added to the 5' untranslated region or the 
coding sequence of the partial coding sequence to increase the amount of the mature 
message that accumulates in the cytosol. Inclusion of a spliceable intron in the 
transcription unit in both plant and animal expression constructs has been shown to 

30 increase gene expression at both the mRNA and protein levels up to 1000-fold. 
Buchman and Berg, Mol. Cell Biol 8: 4395-4405 (1988); Callis et al., Genes Dev. 1: 
1183-1200 (1987). Such intron enhancement of gene expression is typically greatest 
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when placed near the 5* end of the transcription unit. Use of maize introns Adhl-S 
intron 1, 2, and 6, the Bronze- 1 intron are known in the art. See generally, The Maize 
Handbook, Chapter 116, Freeling and Walbot, Eds., Springer, New York (1994). 

The vector comprising the sequences from a polynucleotide of the 
S present invention will typically comprise a marker gene which confers a selectable 
phenotype on plant cells. Usually, the selectable marker gene will encode antibiotic 
resistance, with suitable genes including genes coding for resistance to the antibiotic 
spectinomycin (e.g., the aada gene), the streptomycin phosphotransferase (SPT) gene 
coding for streptomycin resistance, the neomycin phosphotransferase (NPTII) gene 

10 encoding kanamycin or geneticin resistance, the hygromycin phosphotransferase (HPT) 
gene coding for hygromycin resistance, genes coding for resistance to herbicides which 
act to inhibit the action of acetolactate synthase (ALS), in particular the 
sulfonylurea-type herbicides (e.g., the acetolactate synthase (ALS) gene containing 
mutations leading to such resistance in particular the S4 and/or Hra mutations), genes 

15 coding for resistance to herbicides which act to inhibit action of glutamine synthase, 
such as phosphinothricin or basta (e.g., the bar gene), or other such genes known in the 
art. The bar gene encodes resistance to the herbicide basta, the nptll gene encodes 
resistance to the antibiotics kanamycin and geneticin, and the ALS gene encodes 
resistance to the herbicide chlorsulfiiron. 

20 Typical vectors useful for expression of genes in higher plants are well 

known in the art and include vectors derived from the tumor-inducing (Ti) plasmid of 
Agrobacterium tumefaciens described by Rogers et aL, Meth. In Enzymol. , 153:253- 
277 (1987). These vectors are plant integrating vectors in that on transformation, the 
vectors integrate a portion of vector DNA into the genome of the host plant. 

25 Exemplary A. tumefaciens vectors useful herein are plasmid s pKYLX6 and pKYLX7 of 
Schardl et aL, Gene, 61:1-11 (1987) and Berger et al., Proc. Natl. Acad. Sci. U.S.A., 
86:8402-8406 (1989). Another useful vector herein is plasmid pBI101.2 that is 
available from Clontech Laboratories, Inc. (Palo Alto, CA). 

A polynucleotide of the present invention can be expressed in either 

30 sense or anti-sense orientation as desired. It will be appreciated that control of gene 
expression in either sense or anti-sense orientation can have a direct impact on the 
observable plant characteristics. Antdsense technology can be conveniently used to gene 
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expression in plants. To accomplish this, a nucleic acid segment from the desired gene 
is cloned and operably linked to a promoter such that the anti-sense strand of RNA will 
be transcribed. The construct is then transformed into plants and the anti sense strand of 
RNA is produced. In plant cells, it has been shown that antisense RNA inhibits gene 
5 expression by preventing the accumulation of mRNA which encodes the enzyme of 
interest, see, e.g., Sheehy etal., Proc. Nat'l. Acad. ScL (USA) 85: 8805-8809 (1988); 
and Hiatt et a/., U.S. Patent No. 4,801,340. 

Another method of suppression is sense suppression. Introduction of 
nucleic acid configured in the sense orientation has been shown to be an effective means 

10 by which to block the transcription of target genes. For an example of the use of this 
method to modulate expression of endogenous genes see, Napoli et a/., The Plant Cell 
2: 279-289 (1990) and U.S. Patent No. 5,034,323. 

Catalytic RNA molecules or ribozymes can also be used to inhibit 
expression of plant genes. It is possible to design ribozymes that specifically pair with 

15 virtually any target RNA and cleave the phosphodiester backbone at a specific location, 
thereby functionally inactivating the target RNA. In carrying out this cleavage, the 
ribozyme is not itself altered, and is thus capable of recycling and cleaving other 
molecules, making it a true enzyme. The inclusion of ribozyme sequences within 
antisense RNAs confers RNA-cleaving activity upon them, thereby increasing the 

20 activity of the constructs. The design and use of target RNA-specific ribozymes is 
described in Haseloff et al., Nature 334: 585-591 (1988). 

A variety of cross-linking agents, alkylating agents and radical 
generating species as pendant groups on polynucleotides of the present invention can be 
used to bind, label, detect, and/or cleave nucleic acids. For example, Vlassov, V. V., et 

25 aL, Nucleic Acids Res (1986) 14:4065-4076, describe covalent bonding of a angle- 
stranded DNA fragment with alkylating derivatives of nucleotides complementary to target 
sequences. A report of similar work by the same group is that by Knorre, D. G., et al., 
Biochimie (1985) 67:785-789. Iverson and Dervan also showed sequence-specific 
cleavage of single-stranded DNA mediated by incorporation of a modified nucleotide 

30 which was capable of activating cleavage (7 Am Chem Soc (1987) 109:1241-1243). 
Meyer, K B., et al., J Am Chem Soc (1989) 111:8517-8519, effect covalent crosslinking 
to a target nucleotide using an alkylating agent complementary to the single-stranded 
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target nucleotide sequence. A photoactivated crosslinking to single-stranded 
oligonucleotides mediated by psoralen was disclosed by Lee, B. L., et al.. Biochemistry 
(1988) 27:3 197-3203. Use of crosslinking in triple-helix forming probes was also disclosed 
by Home, et al, J Am Chem Soc (1990) 112:2435-2437. Use of N4, N4-ethanocytosine 
as an alkylating agent to crosslink to single-stranded oligonucleotides has also been 
described by Webb and Matteucci. J Am Chem Soc (1986) 108:2764-2765; Nucleic Acids 
Res (1986) 14:7661-7674; Feteritz et al, J. Am. Chem. Soc. 113:4000 (1991). Various 
compounds to bind, detect, label, and/or cleave nucleic acids are known in the art. See, for 
example, U.S. Patent Nos. 5,543,507; 5,672,593; 5,484,908; 5,256,648; and, 5,681941. 



Proteins 

The isolated proteins of the present invention comprise a polypeptide 
having at least 10 amino acids encoded by any one of the polynucleotides of the present 
invention as discussed more fully, supra, or polypeptides which are conservatively 
15 modified variants thereof. Exemplary polypeptide sequences are provided in SEQ ID 
NOS: 1-18 and 73-75. The proteins of the present invention or variants thereof can 
comprise any number of contiguous amino acid residues from a polypeptide of the 
present invention, wherein that number is selected from the group of integers consisting 
of from 10 to the number of residues in a full-length lignin biosynthesis polypeptide. 
20 Optionally, this subsequence of contiguous amino acids is at least 15, 20, 25, 30, 35, or 
40 amino acids in length, often at least 50, 60, 70, 80, or 90 amino acids in length. 
Further, the number of such subsequences can be any integer selected from the group 
consisting of from 1 to 20, such as 2, 3, 4, or 5. 

As those of skill will appreciate, the present invention includes 
25 catalytically active polypeptides of the present invention (i.e., enzymes). Catalytically 
active polypeptides have a specific activity at least 20%, 30%, or 40%, and preferably 
at least 50%, 60%, or 70%, and most preferably at least 80%, 90%, or 95% that of the 
native (non-synthetic), endogenous polypeptide. Further, the substrate specificity 
QUKJ is optionally substantially similar to the native (non-synthetic), endogenous 
30 polypeptide. Typically, the K„ will be at least 30%, 40%, or 50%, that of the native 
(non-synthetic), endogenous polypeptide; and more preferably at least 60%, 70%, 
80%, or 90%. Methods of assaying and quantifying measures of enzymatic activity and 
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substrate specificity (k €11 /K Tn ), are well known to those of skill in the art. 

Generally, the proteins of the present invention will, when presented as 
an immunogen, elicit production of an antibody specifically reactive to a polypeptide of 
the present invention encoded by a polynucleotide of the present invention as described, 
5 supra. Exemplary polypeptides include those which are full-length, such as those 
disclosed in SEQ ID NOS: 1-18 and 73-75. Further, the proteins of the present 
invention will not bind to antisera raised against a polypeptide of the present invention 
which has been fully immunosorbed with the same polypeptide. Immunoassays for 
determining binding are well known to those of skill in the art. A preferred 
10 immunoassay is a competitive immunoassay as discussed, infra. Thus, the proteins of 
the present invention can be employed as immunogens for constructing antibodies 
immunoreactive to a protein of the present invention for such exemplary utilities as 
immunoassays or protein purification techniques. 

IS Expression of Proteins in Host Cells 

Using the nucleic acids of the present invention, one may express a 
protein of the present invention in a recombinantly engineered cell such as bacteria, 
yeast, insect, mammalian, or preferably plant cells. The cells produce the protein in a 
non-natural condition (e.g., in quantity, composition, location, and/or time), because 

20 they have been genetically altered through human intervention to do so. 

It is expected that those of skill in the art are knowledgeable in the 
numerous expression systems available for expression of a nucleic acid encoding a 
protein of the present invention. No attempt to describe in detail the various methods 
known for the expression of proteins in prokaryotes or eukaryotes will be made. 

25 In brief summary, the expression of isolated nucleic acids encoding a 

protein of the present invention will typically be achieved by operably linking, for 
example, the DNA or cDNA to a promoter (which is either constitutive or inducible), 
followed by incorporation into an expression vector. The vectors can be suitable for 
replication and integration in either prokaryotes or eukaryotes. Typical expression 

30 vectors contain transcription and translation terminators, initiation sequences, and 
promoters useful for regulation of the expression of the DNA encoding a protein of the 
present invention. To obtain high level expression of a cloned gene, it is desirable to 
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construct expression vectors which contain, at the minimum, a strong promoter to direct 
transcription, a ribosome binding site for translational initiation, and a 
transcription/translation terminator. One of skill would recognize that modifications 
can be made to a protein of the present invention without diminishing its biological 

5 activity. Some modifications may be made to facilitate the cloning, expression, or 
incorporation of the targeting molecule into a fusion protein. Such modifications are 
well known to those of skill in the art and include, for example, a methionine added at 
the amino terminus to provide an initiation site, or additional amino acids {e.g., poly 
His) placed on either terminus to create conveniently located restriction sites or 

0 termination codons or purification sequences. 

A. Expression in Prokaryotes 

Prokaryotic cells may be used as hosts for expression. Prokaryotes most 
frequently are represented by various strains of E. coli; however, other microbial 

5 strains may also be used. Commonly used prokaryotic control sequences which are 
defined herein to include promoters for transcription initiation, optionally with an 
operator, along with ribosome binding site sequences, include such commonly used 
promoters as the beta lactamase (penicillinase) and lactose (lac) promoter systems 
(Chang et al., Nature 198:1056 (1977)), the tryptophan (trp) promoter system (Goeddel 

0 et al., Nucleic Acids Res. 8:4057 (1980)) and the lambda derived P L promoter and N- 
gene ribosome binding site (Shimatake et aL, Nature 292:128 (1981)). The inclusion 
of selection markers in DNA vectors transfected in E. coli is also useful. Examples of 
such markers include genes specifying resistance to ampicillin, tetracycline, or 
chloramphenicol. 

5 The vector is selected to allow introduction into the appropriate host cell. 

Bacterial vectors are typically of plasmid or phage origin. Appropriate bacterial cells 
are infected with phage vector particles or transfected with naked phage vector DNA. 
If a plasmid vector is used, the bacterial cells are transfected with the plasmid vector 
DNA. Expression systems for expressing a protein of the present invention are 

0 available using Bacillus sp. and Salmonella (Palva, et al. 9 Gene 22: 229-235 (1983); 
Mosbach, et al., Nature 302: 543-545 (1983)). 
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B. Expression in Eukaryotes 

A variety of eukaryotic expression systems such as yeast, insect cell 
lines, plant and mammalian cells, are known to those of skill in the art. As explained 
briefly below, a of the present invention can be expressed in these eukaryotic systems. 
In some embodiments, transformed/transfected plant cells, as discussed infra, are 
employed as expression systems for production of the proteins of the instant invention. 

Synthesis of heterologous proteins in yeast is well known. Sherman, F., 
et al., Methods in Yeast Genetics, Cold Spring Harbor Laboratory (1982) is a well 
recognized work describing the various methods available to produce the protein in 
yeast. Suitable vectors usually have expression control sequences, such as promoters, 
including 3-phosphoglycerate kinase or other glycolytic enzymes, and an origin of 
replication, termination sequences and the like as desired. For instance, suitable 
vectors are described in the literature (Botstein, et al., Gene 8: 17-24 (1979); Broach, 
etai., Gene 8: 121-133 (1979)). 
15 A Protein of the present invention, once expressed, can be isolated from 

yeast by lysing the cells and applying standard protein isolation techniques to the 
lysates. The monitoring of the purification process can be accomplished by using 
Western blot techniques or radioimmunoassay of other standard immunoassay 
techniques. 

The sequences encoding proteins of the present invention can also be 
ligated to various expression vectors for use in transfecting cell cultures of, for 
instance, mammalian, insect, or plant origin. Illustrative of cell cultures useful for the 
production of the peptides are mammalian cells. Mammalian cell systems often will be 
in the form of monolayers of cells although mammalian cell suspensions may also be 
used. A number of suitable host cell lines capable of expressing intact proteins have 
been developed in the art, and include the HEK293, BHK21, and CHO cell lines. 
Expression vectors for these cells can include expression control sequences, such as an 
origin of replication, a promoter (e.g., the CMV promoter, a HSV tk promoter or pgk 
(phosphoglycerate kinase) promoter), an enhancer (Queen et al., Immunol. Rev. 89: 49 
(1986)), and necessary processing information sites, such as ribosome binding sites, 
RNA splice sites, polyadenylation sites (e.g., an SV40 large T Ag poly A addition 
site), and transcriptional terminator sequences. Other animal cells useful for production 
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of proteins of the present invention are available, for instance, from the American Type 
Culture Collection Catalogue of Cell Lines and Hybridomas (7th edition, 1992). 

Appropriate vectors for expressing proteins of the present invention in 
insect cells are usually derived from the SF9 baculovirus. Suitable insect cell lines 

5 include mosquito larvae, silkworm, armyworm, moth and Drosophila cell lines such as 
a Schneider cell line (See Schneider, /. EmbiyoL Exp. Morphol 27: 353-365 (1987). 

As with yeast, when higher animal or plant host cells are employed, 
polyadenlyation or transcription terminator sequences are typically incorporated into the 
vector. An example of a terminator sequence is the polyadenlyation sequence from the 

0 bovine growth hormone gene. Sequences for accurate splicing of the transcript may 
also be included. An example of a splicing sequence is the VP1 intron from SV40 
(Sprague, et al. 9 7. Virol 45: 773-781 (1983)). Additionally, gene sequences to 
control replication in the host cell may be incorporated into the vector such as those 
found in bovine papilloma virus type-vectors. Saveria-Campo, M., Bovine Papilloma 

5 Virus DNA a Eukaryotic Cloning Vector in DNA Cloning Vol II a Practical Approach, 
D.M. Glover, Ed., IRL Press, Arlington, Virginia pp. 213-238 (1985). 

Tranrf^l^n^IYflwicpf ormation of CglUf 

The method of transformation/transfection is not critical to the instant 
D invention; various methods of transformation or transfection are currently available. As 
newer methods are available to transform crops or other host cells they may be directly 
applied. Accordingly, a wide variety of methods have been developed to insert a DNA 
sequence into the genome of a host cell to obtain the transcription and/or translation of 
the sequence to effect phenotypic changes in the organism. Thus, any method which 
5 provides for efficient transformation/transfection may be employed. 

A. Plant Transformation 

A DNA sequence coding for the desired polynucleotide of the present 
invention, for example a cDNA or a genomic sequence encoding a full length protein, 
3 will be used to construct a recombinant expression cassette which can be introduced into 
the desired plant 

Isolated nucleic acid acids of the present invention can be introduced into 
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plants according techniques known in the art. Generally, recombinant expression 
cassettes as described above and suitable for transformation of plant cells are prepared. 
Techniques for transforming a wide variety of higher plant species are well known and 
described in the technical, scientific, and patent literature. See, for example, Weising 
5 et al. f Ann. Rev. Genet. 22: 421-477 (1988). For example, the DNA construct may be 
introduced directly into the genomic DNA of the plant cell using techniques such as 
electroporation, PEG poration, particle bombardment, silicon fiber delivery, or 
microinjection of plant cell protoplasts or embryogenic callus. Alternatively, the DNA 
constructs may be combined with suitable T-DNA flanking regions and introduced into 

10 a conventional Agrobacterium twnefadens host vector. The virulence functions of the 
Agrobacteriwn twnefaciens host will direct the insertion of the construct and adjacent 
marker into the plant cell DNA when the cell is infected by the bacteria. 

The introduction of DNA constructs using polyethylene glycol 
precipitation is described in Paszkowski et aL, Embo J. 3: 2717-2722 (1984), 

15 Electroporation techniques are described in Fromm et d., Proc. Natl Acad. Sci. 82: 
5824 (1985). Ballistic transformation techniques are described in Klein et aL, Nature 
327: 70-73 (1987). 

Agrobacterium tumefaciens-meditated transformation techniques are well described in 
the scientific literature. See, for example Horsch et a/., Science 233: 496-498 (1984), 
20 and Fraley et al., Proc. Natl. Acad. Sci. 80: 4803 (1983). Although Agrobacterium is 
useful primarily in dicots, certain monocots can be transformed by Agrobacterium. For 
instance, Agrobacterium transformation of maize is described in U.S. Patent No. 
5,550,318. 

Other methods of transfection or transformation include (1) 
25 Agrobacterium rhizogenes-mediated transformation (see, e.g., Lichtenstein and Fuller 
In: Genetic Engineering, vol. 6, PWJ Rigby, Ed., London, Academic Press, 1987; and 
Lichtenstein, C. P., and Draper, J,. In: DNA Cloning, Vol. H, D. M. Glover, Ed., 
Oxford, IRI Press, 1985), Application PCT/US87/02512 (WO 88/02405 published Apr. 
7, 1988) describes the use of A.rhizogenes strain A4 and its Ri plasmid along with A. 
30 twnefaciens vectors pARC8 or pARC16 (2) liposome-mediated DNA uptake (see, e.g., 
Freeman et al., Plant Cell Physiol. 25: 1353, 1984), (3) the vortexing method (see, 
e.g., Kindle, Proc. Natl. Acad. ScL, USA 87: 1228, (1990). 
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DNA can also be introduced into plants by direct DNA transfer into 
pollen as described by Zhou et al. f Methods in Enzymology, 101:433 (1983); D. Hess, 
Intern Rev. CytoL, 107:367 (1987); Luo et al. 9 Plane MoL Biol. Reporter, 6:165 
(1988). Expression of polypeptide coding genes can be obtained by injection of the 

5 DNA into reproductive organs of a plant as described by Pena et al., Nature, 325.:274 
(1987). DNA can also be injected directly into the cells of immature embryos and the 
rehydration of desiccated embryos as described by Neuhaus et al., Theor. Appl. 
Genet., 75:30 (1987); and Benbrook et al., in Proceedings Bio Expo 1986, 
Butterworth, Stoneham, Mass., pp. 27-54 (1986). A variety of plant viruses that can 

10 be employed as vectors are known in the art and include cauliflower mosaic virus 
(CaMV), geminivirus, brome mosaic virus, and tobacco mosaic vims. 

B. Transection of Prokaryotes, Lower Eukaryotes, and Animal Cells 

Animal and lower eukaryotic (e.g., yeast) host cells are competent or 

15 rendered competent for transfection by various means. There are several well-known 
methods of introducing DNA into animal cells. These include: calcium phosphate 
precipitation, fusion of the recipient cells with bacterial protoplasts containing the 
DNA, treatment of the recipient cells with liposomes containing the DNA, DEAE 
dextran, electroporation, biolistics, and micro-injection of the DNA directly into the 

20 cells. The transfected cells are cultured by means well known in the art. Kuchler, R.J., 
Biochemical Methods in Cell Culture and Virology, Dowden, Hutchinson and Ross, 
Inc. (1977). 

Synthesis of Proteins 

25 The proteins of the present invention can be constructed using non- 

cellular synthetic methods. Solid phase synthesis of proteins of less than about 50 
amino acids in length may be accomplished by attaching the C-terminal amino acid of 
the sequence to an insoluble support followed by sequential addition of the remaining 
amino acids in the sequence. Techniques for solid phase synthesis are described by 

30 Barany and Merrifield, Solid-Phase Peptide Synthesis, pp. 3-284 in The Peptides: 
Analysis, Synthesis, Biology. Vol. 2: Special Methods in Peptide Synthesis, Part A.\ 
Merrifield, et al., J. Am. Chem. Soc. 85: 2149-2156 (1963), and Stewart et al, Solid 
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Phase Peptide Synthesis, 2nd ed., Pierce Chem. Co., Rockford, El. (1984). Proteins 
of greater length may be synthesized by condensation of the amino and carboxy termini 
of shorter fragments. Methods of forming peptide bonds by activation of a carboxy 
terminal end (e.g., by the use of the coupling reagent N,N'- 
5 dicycylohexylcarbodiimide)) is known to those of skill. 

Purification of Proteins 

The proteins of the present invention may be purified by standard 
techniques well known to those of skill in the art. Recombinantly produced proteins of 
10 the present invention can be directly expressed or expressed as a fusion protein. The 
recombinant protein is purified by a combination of cell lysis (e.g., sonication, French 
press) and affinity chromatography. For fusion products, subsequent digestion of the 
fusion protein with an appropriate proteolytic enzyme releases the desired recombinant 
protein. 

15 The proteins of this invention, recombinant or synthetic, may be purified 

to substantial purity by standard techniques well known in the art, including selective 
precipitation with such substances as ammonium sulfate, column chromatography, 
immunopurification methods, and others. See, for instance, R. Scopes, Protein 
Purification: Principles and Practice, Springer- Verlag: New York (1982); Deutscher, 

20 Guide to Protein Purification, Academic Press (1990). For example, antibodies may be 
raised to the proteins as described herein. Purification from E. coli can be achieved 
following procedures described in U.S. Patent No. 4,511,503. The protein may then 
be isolated from cells expressing the protein and further purified by standard protein 
chemistry techniques as described herein. Detection of the expressed protein is 

25 achieved by methods known in the art and include, for example, radioimmunoassays, 
Western blotting techniques or immunoprecipitation. 

Transgenic Plant Regeneration 

Transformed plant cells which are derived by any of the above 
30 transformation techniques can be cultured to regenerate a whole plant which possesses 
the transformed genotype. Such regeneration techniques often rely on manipulation of 
certain phytohormones in a tissue culture growth medium, typically relying on a biocide 



WO 99/10498 



- 56 - 



PCT/US98/17519 



and/or herbicide marker which has been introduced together with a polynucleotide of 
the present invention. 

Plants cells transformed with a plant expression vector can be 
regenerated, e.g., from single cells, callus tissue or leaf discs according to standard 
5 plant tissue culture techniques. It is well known in the art that various cells, tissues, 
and organs from almost any plant can be successfully cultured to regenerate an entire 
plant. Plant regeneration from cultured protoplasts is described in Evans et al., 
Protoplasts Isolation and Culture, Handbook of Plant Cell Culture, Macmillilan 
Publishing Company, New York, pp. 124-176 (1983); and Binding, Regeneration of 

10 Plants, Plant Protoplasts, CRC Press, Boca Raton, pp. 21-73 (1985). 

The regeneration of plants containing the foreign gene introduced by 
Agrobacteriwn from leaf explants can be achieved as described by Horsch et al., 
Science, 227:1229-1231 (1985). In this procedure, transformants are grown in the 
presence of a selection agent and in a medium that induces the regeneration of shoots in 

15 the plant species being transformed as described by Fraley et al., Proc. Natl. Acad. Sci. 
U.S.A., 80:4803 (1983). This procedure typically produces shoots within two to four 
weeks and these transformant shoots are then transferred to an appropriate root-inducing 
medium containing the selective agent and an antibiotic to prevent bacterial growth. 
Transgenic plants of the present invention may be fertile or sterile. 

20 Regeneration can also be obtained from plant callus, explants, organs, or 

parts thereof. Such regeneration techniques are described generally in Klee et al., Ann. 
Rev. of Plant Phys. 38: 467-486 (1987). The regeneration of plants from either single 
plant protoplasts or various explants is well known in the art. See, for example, 
Methods for Plant Molecular Biology, A. Weissbach and H. Weissbach, eds., 

25 Academic Press, Inc., San Diego, Calif. (1988). This regeneration and growth process 
includes the steps of selection of transformant cells and shoots, rooting the transformant 
shoots and growth of the plantlets in soil. For maize cell culture and regeneration see 
generally, The Maize Handbook, Freeling and Walbot, Eds., Springer, New York 
(1994); Corn and Corn Improvement, 3 rd edition, Sprague and Dudley Eds., American 

30 Society of Agronomy, Madison, Wisconsin (1988). 

One of skill will recognize that after the recombinant expression cassette 
is stably incorporated in transgenic plants and confirmed to be operable, it can be 
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introduced into other plants by sexual crossing. Any of a number of standard breeding 
techniques can be used, depending upon the species to be crossed. 

In vegetatively propagated crops, mature transgenic plants can be 
propagated by the taking of cuttings or by tissue culture techniques to produce multiple 
5 identical plants. Selection of desirable transgenics is made and new varieties are 
obtained and propagated vegetatively for commercial use. In seed propagated crops, 
mature transgenic plants can be self crossed to produce a homozygous inbred plant. 
The inbred plant produces seed containing the newly introduced heterologous nucleic 
acid. These seeds can be grown to produce plants that would produce the selected 

10 phenotype, (e.g., altered lignin biosynthesis content or composition). 

Parts obtained from the regenerated plant, such as flowers, seeds, leaves, 
branches, fruit, and the like are included in the invention, provided that these parts 
comprise cells comprising the isolated nucleic acid of the present invention. Progeny 
and variants, and mutants of the regenerated plants are also included within the scope of 

15 the invention, provided that these parts comprise the introduced nucleic acid sequences. 

Transgenic plants expressing the selectable marker can be screened for 
transmission of the nucleic acid of the present invention by, for example, standard 
immunoblot and DNA detection techniques. Transgenic lines are also typically 
evaluated on levels of expression of the heterologous nucleic acid. Expression at the 

20 RNA level can be determined initially to identify and quantitate expression-positive 
plants. Standard techniques for RNA analysis can be employed and include PCR 
amplification assays using oligonucleotide primers designed to amplify only the 
heterologous RNA templates and solution hybridization assays using heterologous 
nucleic acid-specific probes. The RNA-positive plants can then analyzed for protein 

25 expression by Western immunoblot analysis using the specifically reactive antibodies of 
the present invention. In addition, in situ hybridization and immunocytochemistry 
according to standard protocols can be done using heterologous nucleic acid specific 
polynucleotide probes and antibodies, respectively, to localize sites of expression within 
transgenic tissue. Generally, a number of transgenic lines are usually screened for the 

30 incorporated nucleic acid to identify and select plants with the most appropriate 
expression profiles. 

A preferred embodiment is a transgenic plant that is homozygous for the 
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added heterologous nucleic acid; i.e., a transgenic plant that contains two added nucleic 
acid sequences, one gene at the same locus on each chromosome of a chromosome pair. 
A homozygous transgenic plant can be obtained by sexually mating (selfing) a 
heterozygous transgenic plant that contains a single added heterologous nucleic acid, 
5 germinating some 

of the seed produced and analyzing the resulting plants produced for altered 
lignification relative to a control plant (i.e., native, non-transgenic). Back-crossing to a 
parental plant and out-crossing with a non- transgenic plant are also contemplated. 

10 Modulating lignin biosynthesis Cont ent and/or Composition 

The present invention further provides a method for modulating (i.e., 
increasing or decreasing) lignin biosynthesis content or composition in a plant or part 
thereof. Modulation can be effected by increasing or decreasing the lignin biosynthesis 
content (i.e., the total amount of lignin biosynthesis) and/or the lignin biosynthesis 

15 composition (the ratio of various lignin biosynthesis monomers in the plant) in a plant. 
The method comprises transforming a plant cell with a recombinant expression cassette 
comprising a polynucleotide of the present invention as described above to obtain a 
transformed plant cell, growing the transformed plant cell under plant forming 
conditions, and inducing expression of a polynucleotide of the present invention in the 

20 plant for a time sufficient to modulate lignin biosynthesis content and/or composition in 
the plant or plant part. 

In some embodiments, lignification in a plant may be modulated by 
altering, in vivo or in vitw, the promoter of a non-isolated lignin biosynthesis gene to 
up- or down-regulate gene expression. In some embodiments, the coding regions of 

25 native lignin biosynthesis genes can be altered via substitution, addition, insertion, or 
deletion to decrease activity of the encoded enzyme. See, e.g., Kmiec, U.S. Patent 
5,565,350; Zarling et a/., PCT/US93/03868. And in some embodiments, an isolated 
nucleic acid (e.g., a vector) comprising a promoter sequence is transfected into a plant 
cell. Subsequently, a plant cell comprising the promoter operably linked to a 

30 polynucleotide of the present invention is selected for by means known to those of skill 
in the art such as, but not limited to, Southern blot, DNA sequencing, or PCR analysis 
using primers specific to the promoter and to the gene and detecting amplicons 
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produced therefrom. A plant or plant part altered or modified by the foregoing 
embodiments is grown under plant forming conditions for a time sufficient to modulate 
lignin biosynthesis content and/or composition in the plant. Plant forming conditions 
are well known in the art and discussed briefly, supra. 

In general, content or composition is increased or decreased by at least 
596, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% relative to a native 
control plant, plant part, or cell lacking the aforementioned recombinant expression 
cassette. Modulation in the present invention may occur during and/or subsequent to 
growth of the plant to the desired stage of development. Modulating nucleic acid 
expression temporally and/or in particular tissues can be controlled by employing the 
appropriate promoter operably linked to a polynucleotide of the present invention in, 
for example, sense or antisense orientation as discussed in greater detail, supra. 
Induction of expression of a polynucleotide of the present invention can also be 
controlled by exogenous administration of an effective amount of inducing compound. 
15 Inducible promoters and inducing compounds which activate expression from these 
promoters are well known in the art. In preferred embodiments, lignification is 
modulated in monocots, particularly maize. 



10 



20 



Molecular M^rkgES 

The present invention provides a method of genotyping a plant 
comprising a polynucleotide of the present invention. Preferably, the plant is a 
monocot, such as maize or sorghum. Genotyping provides a means of distinguishing 
homologs of a chromosome pair and can be used to differentiate segregants in a plant 
population. Molecular marker methods can be used for phylogenetic studies, 
25 characterizing genetic relationships among crop varieties, identifying crosses or somatic 
hybrids, localizing chromosomal segments affecting monogenic traits, map based 
cloning, and the study of quantitative inheritance. See, e.g., Plant Molecular Biology: 
A Laboratory Manual, Chapter 7, Clark, Ed., Springer-Verlag, Berlin (1997). For 
molecular marker methods, see generally, The DNA Revolution by Andrew H. 
Paterson 1996 (Chapter 2) in: Genome Mapping in Plants (ed. Andrew H. Paterson) by 
Academic Press/R. G. Landis Company, Austin, Texas, pp. 7-21. 

The particular method of genotyping in the present invention may 
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employ any number of molecular marker analytic techniques such as, but not limited to, 
restriction fragment length polymorphisms (RFLPs). RFLPs are the product of allelic 
differences between DNA restriction fragments caused by nucleotide sequence 
variability. As is well known to those of skill in the art, RFLPs are typically detected 
5 by extraction of genomic DNA and digestion with a restriction enzyme. Generally, the 
resulting fragments are separated according to size and hybridized with a probe; single 
copy probes are preferred. Restriction fragments from homologous chromosomes are 
revealed. Differences in fragment size among alleles represent an RFLP. Thus, the 
present invention further provides a means to follow segregation of a lignin biosynthesis 

10 gene or nucleic acid of the present invention as well as chromosomal sequences 
genetically linked to these genes or nucleic acids using such techniques as RFLP 
analysis. Linked chromosomal sequences are within 50 centiMorgans (cM), often 
within 40 or 30 cM, preferably within 20 or 10 cM, more preferably within 5, 3, 2, or 
1 cM of a lignin biosynthesis gene. 

15 In the present invention, the nucleic acid probes employed for molecular 

marker mapping of plant nuclear genomes selectively hybridize, under selective 
hybridization conditions, to a gene encoding a polynucleotide of the present invention. 
In preferred embodiments, the probes are selected from polynucleotides of the present 
invention. Typically, these probes are cDNA probes or Pst I genomic clones. The 

20 length of the probes is discussed in greater detail, supra, but are typically at least 15 
bases in length, more preferably at least 20, 25, 30, 35, 40, or 50 bases in length. 
Generally, however, the probes are less than about 1 kilobase in length. Preferably, the 
probes are single copy probes that hybridize to a unique locus in a haploid chromosome 
complement. Some exemplary restriction enzymes employed in RFLP mapping are 

25 EcoRI, EcoRv, and Sstl. As used herein the term "restriction enzyme" includes 
reference to a composition that recognizes and, alone or in conjunction with another 
composition, cleaves at a specific nucleotide sequence. 



genomic DNA of a plant with a restriction enzyme; (b) hybridizing a nucleic acid 
30 probe, under selective hybridization conditions, to a sequence of a polynucleotide of the 
present of said genomic DNA; (c) detecting therefrom a RFLP. Other methods of 
differentiating polymorphic (allelic) variants of polynucleotides of the present invention 



The method of detecting an RFLP comprises the steps of (a) digesting 
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can be had by utilizing molecular marker techniques well known to those of skill in the 
art including such techniques as: 1) single stranded conformation analysis (SSCP); 2) 
denaturing gradient gel electrophoresis (DGGE); 3) RNase protection assays; 4) allele- 
specific oligonucleotides (ASOs); 5) the use of proteins which recognize nucleotide 
mismatches, such as the E. coti mutS protein; and 6) allele-specific PCR. Other 
approaches based on the detection of mismatches between the two complementary DNA 
strands include clamped denaturing gel electrophoresis (CDGE); heteroduplex analysis 
(HA); and chemical mismatch cleavage (CMC). Exemplary polymorphic variants are 
provided in Table I, supra. Thus, the present invention further provides a method of 
genotyping comprising the steps of contacting, under stringent hybridization conditions, 
a sample suspected of comprising a polynucleotide of the present invention with a 
nucleic acid probe. Generally, the sample is a plant sample; preferably, a sample 
suspected of comprising a maize polynucleotide of the present invention (e.g., gene, 
mRNA). The nucleic acid probe selectively hybridizes, under stringent conditions, to a 
subsequence of a polynucleotide of the present invention comprising a polymorphic 
marker. Selective hybridization of the nucleic acid probe to the polymorphic marker 
nucleic acid sequence yields a hybridization complex. Detection of the hybridization 
complex indicates the presence of that polymorphic marker in the sample. In preferred 
embodiments, the nucleic acid probe comprises a polynucleotide of the present 
20 invention. 



15 



UTR's and TnHnn Preference 

In general, translational efficiency has been found to be regulated by 
specific sequence elements in the 5' non-coding or untranslated region (5' UTR) of the 
25 RNA Positive sequence motifs include translational initiationconsensus sequences 
(Kozak, Nucleic Acids Res. 15:8125 (1987)) and the 5<G> 7 methyl GpppG cap structure 
(Drummondera/., Nucleic Acids Res. 13:7375(1985)). Negative elements include stable 
intramolecular 5' UTR stem-loop structures (Muesing et at., Cell 48:691 (1987)) and 
AUG sequences or short open reading frames preceded by an appropriate AUG in the 5' 
UTR (Kozak, supra, Rao et al., Mol. and Cell Biol. 8:284 (1988)). Accordingly, the 
present invention provides 5' and/or 3' UTR regions for modulation of translation of 
heterologous coding sequences. 
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Further, the polypeptide-encoding segments of the polynucleotides of the 
present invention can be modified to alter codon usage. Altered codon usage can be 
employed to alter translational efficiency and/or to optimize the coding sequence for 
expression in a desired host or to optimize the codon usage in a heterologous sequence for 
5 expression in maize. Codon usage in the coding regions of the polynucleotides of the 
present invention can be analyzed statistically using commercially available software 
packages such as "Codon Preference" available from the University of Wisconsin Genetics 
Computer Group (see Devereaux et al, Nucleic Acids Res. 12: 387-395 (1984)) or 
MacVector 4.1 (Eastman Kodak Co., New Haven, Conn.). Thus, the present invention 

10 provides a codon usage frequency characteristic of the coding region of at least one of the 
polynucleotides of the present invention. The number of polynucleotides that can be used 
to determine a codon usage frequency can be any integer from 1 to the number of 
polynucleotides of the present invention as provided herein. Optionally, the 
polynucleotides will be full-length sequences. An exemplary number of sequences for 

15 statistical analysis can be at least 1, 5, 10, 20, 50, or 100. 
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Sequence Shuffling 

The present invention provides methods for sequence shuffling using 
polynucleotides of the present invention, and compositions resulting therefrom. 
Sequence shuffling is described in PCT publication No. 96/19256. See also, Zhang, J.- 
H., etal. Proc. Natl. Acad. Sci. USA 94:4504-4509 (1997). Generally, sequence shuffling 
provides a means for generating libraries of polynucleotides having a desired characteristic 
which can be selected or screened for. Libraries of recombinant polynucleotides are 
generated from a population of related sequence polynucleotides which comprise 
sequence regions which have substantial sequence identity and can be homologously 
recombined in vitro or in vivo. The population of sequence-recombined polynucleotides 
comprises a subpopulation of polynucleotides which possess desired or advantageous 
characteristics and which can be selected by a suitable selection or screening method. The 
characteristics can be any property or attribute capable of being selected for or detected in 
a screening system, and may include properties of: an encoded protein, a transcriptional 
element, a sequence controlling transcription, RNA processing, RNA stability, chromatin 
conformation, translation, or other expression property of a gene or transgene, a 
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replicative dement, a protein-binding element, or the like, such as any feature which 
confers a selectable or detectable property. In some embodiments, the selected 
characteristic will be an increased K» and/or over the wild-type protein as provided 
herein. In other embodiments, a protein or polynculeotide generated from sequence 
5 shuffling will have a ligand binding affinity greater than the non-shuffled wild-type 
polynucleotide. The increase in such properties can be at least 1 10%, 120%, 130%, 140% 
or at least 150% of the wild-type value. 
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Detection of Nucleic Arfrfs 

The present invention further provides methods for detecting a 
polynucleotide of the present invention in a nucleic acid sample suspected of comprising 
a polynucleotide of the present invention, such as a plant cell lysate, particularly a 
lysate of corn. In some embodiments, a lignin biosynthesis gene or portion thereof can 
be amplified prior to the step of contacting the nucleic acid sample with a 
polynucleotide of the present invention. The nucleic acid sample is contacted with the 
polynucleotide to form a hybridization complex. The polynucleotide hybridizes under 
stringent conditions to a gene encoding a polypeptide of the present invention. 
Formation of the hybridization complex is used to detect a gene encoding a polypeptide 
of the present invention in the nucleic acid sample. Those of skill will appreciate that 
an isolated nucleic acid comprising a polynucleotide of the present invention should 
lack cross-hybridizing sequences in common with non-lignin biosynthesis genes that 
would yield a false positive result 

Detection of the hybridization complex can be achieved using any 
number of well known methods. For example, the nucleic acid sample, or a portion 
thereof, may be assayed by hybridization formats including but not limited to, solution 
phase, solid phase, mixed phase, or in situ hybridization assays. Briefly, in solution (or 
liquid) phase hybridizations, both the target nucleic acid and the probe or primer are 
free to interact in the reaction mixture. In solid phase hybridization assays, probes or 
primers are typically linked to a solid support where they are available for hybridization 
with target nucleic in solution. In mixed phase, nucleic acid intermediates in solution 
hybridize to target nucleic acids in solution as well as to a nucleic acid linked to a solid 
support. In in situ hybridization, the target nucleic acid is liberated from its cellular 
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surroundings in such as to be available for hybridization within the cell while 
preserving the cellular morphology for subsequent interpretation and analysis. The 
following articles provide an overview of the various hybridization assay formats: 
Singer et al., Biotechniques 4(3): 230-250 (1986); Haase et al., Methods in Virology, 
5 Vol. VII, pp. 189-226 (1984); Wilkinson, The theory and practice of in situ 
hybridization in: In situ Hybridization, D.G. Wilkinson, Ed., IRL Press, Oxford 
University Press, Oxford; and Nucleic Acid Hybridization: A Practical Approach, 
Hames, B.D. and Higgins, S.J., Eds., IRL Press (1987). 

10 NwleiC Arid labels and Detection MpfhnHc 

The means by which nucleic acids of the present invention are labeled is 
not a critical aspect of the present invention and can be accomplished by any number of 
methods currently known or later developed. Detectable labels suitable for use in the 
present invention include any composition detectable by spectroscopic, radioisotopic, 

15 photochemical, biochemical, immunochemical, electrical, optical or chemical means. 
Useful labels in the present invention include biotin for staining with labeled 
streptavidin conjugate, magnetic beads, fluorescent dyes (e.g., fluorescein, texas red, 
rhodamine, green fluorescent protein, and the like), radiolabels (e.g., 3 H, ,2S I, JS S, M C, 
or M P), enzymes (e.g., horse radish peroxidase, alkaline phosphatase and others 

20 commonly used in an ELISA), and colorimetric labels such as colloidal gold or colored 
glass or plastic (e.g., polystyrene, polypropylene, latex, etc.) beads. 

Nucleic acids of the present invention can be labeled by any one of 
several methods typically used to detect the presence of hybridized nucleic acids. One 
common method of detection is the use of autoradiography using probes labeled with 

25 S H, '"I, M S, M C, or M P, or the like. The choice of radio-active isotope depends on 
research preferences due to ease of synthesis, stability, and half lives of the selected 
isotopes. Other labels include ligands which bind to antibodies labeled with 
fluorophores, cheimluminescent agents, and enzymes. Alternatively, probes can be 
conjugated directly with labels such as fluorophores, chenuluminescent agents or 

30 enzymes. The choice of label depends on sensitivity required, ease of conjugation with 
the probe, stability requirements, and available instrumentation. Labeling th e nucleic 
acids of the present invention is readily achieved such as by the use of labeled PCR 
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primers. 

In some embodiments, the label is simultaneously incorporated during 
the amplification step in the preparation of the nucleic acids. Thus, for example, 
polymerase chain reaction (PCR) with labeled primers or labeled nucleotides will 
5 provide a labeled amplification product. In another embodiment, transcription 
amplification using a labeled nucleotide (e.g., fluorescein-labeled UTP and/or CTP) 
incorporates a label into the transcribed nucleic acids. 

Non-radioactive probes are often labeled by indirect means. For 
example, a ligand molecule is covalently bound to the probe. The ligand then binds to 

10 an anti-ligand molecule which is either inherently detectable or covalently bound to a 
detectable signal system, such as an enzyme, a fluorophore, or a chemiluminescent 
compound. Enzymes of interest as labels will primarily be hydrolases, such as 
phosphatases, esterases and glycosidases, or oxidoreductases, particularly peroxidases. 
Fluorescent compounds include fluorescein and its derivatives, rhodamine and its 

15 derivatives, dansyl, umbelliferone, etc. Chemiluminescers include luciferin, and 2,3- 
dihydrophthalazmediones, e.g., luminol. Iigands and anti-ligands may be varied 
widely. Where a ligand has a natural anti-ligand, namely Iigands such as biotin, 
thyroxine, and Cortisol, it can be used in conjunction with its labeled, naturally 
occurring anti-ligands. Alternatively, any haptenic or antigenic compound can be used 

20 in combination with an antibody. 

Probes can also be labeled by direct conjugation with a label. For 
example, cloned DNA probes have been coupled directly to horseradish peroxidase or 
alkaline phosphatase, (Renz. M., and Kurz, K., A Colorimetric Method for DNA 
Hybridization, Nucl. Acids Res. 12: 3435-3444 (1984)) and synthetic oligonucleotides 

25 have been coupled direcdy with alkaline phosphatase (Jablonski, E. , et al. , Preparation 
of Oligodeoxynucleotide-Alkaline Phosphatase Conjugates and Their Use as 
Hybridization Probes, Nuc. Acids. Res. 14: 6115-6128 (1986); and Li P., et al., 
Enzyme-linked Synthetic Oligonucleotide probes: Non-Radioactive Detection of 
Enterotoxigenic Escherichia Coli in Faeca Specimens, Nucl. Acids Res. 15: 5275-5287 

30 (1987)). 

Means of detecting such labels are well known to those of skill in the art. 
Thus, for example, radiolabels may be detected using photographic film or scintillation 
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counters, fluorescent markers may be detected using a photodetector to detect emitted 
light. Enzymatic labels are typically detected by providing the enzyme with a substrate 
and detecting the reaction product produced by the action of the enzyme on the 
substrate, and colorimetric labels are detected by simply visualizing the colored label. 

5 

Antibodies to Proteins 

Antibodies can be raised to a protein of the present invention, including 
individual, allelic, strain, or species variants, and fragments thereof, both in their 
naturally occurring (full-length) forms and in recombinant forms. Additionally, 
antibodies are raised to these proteins in either their native configurations or in non- 
native configurations. Anti-idiotypic antibodies can also be generated. Many methods 
of making antibodies are known to persons of skill. The following discussion is 
presented as a general overview of the techniques available; however, one of skill will 
recognize that many variations upon the following methods are known. 

A number of immunogens are used to produce antibodies specifically 
reactive with a protein of the present invention. An isolated recombinant, synthetic, or 
native lignin biosynthesis protein of 5 amino acids in length or greater and selected 
from a protein encoded by a polynucleotide of the present invention, such as exemplary 
sequences of SEQ ID NOS: 1-18 and 73-75, are the preferred immunogens (antigen) 
for the production of monoclonal or polyclonal antibodies. Those of skill will readily 
understand that the proteins of the present invention are typically denatured, and 
optionally reduced, prior to formation of antibodies for screening expression libraries or 
other assays in which a putative protein of the present invention is expressed or 
denatured in a non-native secondary, tertiary, or quartenary structure. Naturally 
occurring lignin biosynthesis polypeptides can be used either in pure or impure form. 

The protein of the present invention is then injected into an animal 
capable of producing antibodies. Either monoclonal or polyclonal antibodies can be 
generated for subsequent use in immunoassays to measure the presence and quantity of 
the protein of the present invention. Methods of producing polyclonal antibodies are 
known to those of skill in the art. In brief, an immunogen (antigen), preferably a 
purified protein, a protein coupled to an appropriate carrier (e.g., GST, keyhole limpet 
hemanocyanin, etc.), or a protein incorporated into an immunization vector such as a 
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recombinant vaccinia virus {see, U.S. Patent No. 4,722,848) is mixed with an adjuvant 
and animals are immunized with the mixture. The animal's immune response to the 
immunogen preparation is monitored by taking test bleeds and determining the titer of 
reactivity to the protein of interest. When appropriately high titers of antibody to the 

5 immunogen arc obtained, blood is collected from the animal and antisera are prepared. 
Further fractionation of the antisera to enrich for antibodies reactive to the protein is 
performed where desired (See, e.g., Coligan, Current Protocols in Immunology, 
Wiley/Greene, NY (1991); and Harlow and Lane, Antibodies: A Laboratory Manual, 
Cold Spring Harbor Press, NY (1989)). 

0 Antibodies, including binding fragments and single chain recombinant 

versions thereof, against predetermined fragments of a protein of the present invention 
are raised by immunizing animals, e.g., with conjugates of the fragments with carrier 
proteins as described above. Typically, the immunogen of interest is a protein of at 
least about 5 amino acids, more typically the protein is 10 amino acids in length, 

5 preferably, 15 amino acids in length and more preferably the protein is 20 amino acids 
in length or greater. The peptides are typically coupled to a carrier protein (e.g., as a 
fusion protein), or are recombinantly expressed in an immunization vector. Antigenic 
determinants on peptides to which antibodies bind are typically 3 to 10 amino acids in 
length. 

0 Monoclonal antibodies are prepared from cells secreting the desired 

antibody. Monoclones antibodies are screened for binding to a protein from which the 
immunogen was derived. Specific monoclonal and polyclonal antibodies will usually 
have an antibody binding site with an affinity constant for its cognate monovalent 
antigen at least between lOMO 7 , usually at least 10 s , preferably at least 10 9 , more 

5 preferably at least 10 10 , and most preferably at least 10" liters/mole. 

In some instances, it is desirable to prepare monoclonal antibodies from 
various mammalian hosts, such as mice, rodents, primates, humans, etc. Description 
of techniques for preparing such monoclonal antibodies are found in, e.g., Basic and 
Clinical Immunology, 4th ed., Stites et al. y Eds., Lange Medical Publications, Los 

0 Altos, CA, and references cited therein; Harlow and Lane, Supra; Goding, Monoclonal 
Antibodies: Principles and Practice, 2nd ed., Academic Press, New York, NY (1986); 
and Kohler and Milstein, Nature 256: 495-497 (1975). Summarized briefly, this 
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method proceeds by injecting an animal with an immunogen comprising a protein of the 
present invention. The animal is then sacrificed and cells taken from its spleen, which 
are fused with myeloma cells. The result is a hybrid cell or "hybridoma" that is 
capable of reproducing in vitro. The population of hybridomas is then screened to 
5 isolate individual clones, each of which secrete a single antibody species to the 
immunogen. In this manner, the individual antibody species obtained are the products 
of immortalized and cloned single B cells from the immune animal generated in 
response to a specific site recognized on the immunogenic substance. 

Other suitable techniques involve selection of libraries of recombinant 

10 antibodies in phage or similar vectors (see, e.g., Huse et al. 9 Science 246: 1275-1281 
(1989); and Ward, et aL, Nature 341: 544-546 (1989); and Vaughan et aL, Nature 
Biotechnology, 14: 309-314 (1996)). Alternatively, high avidity human monoclonal 
antibodies can be obtained from transgenic mice comprising fragments of the 
^rearranged human heavy and light chain Ig loci (i.e., minilocus transgenic mice). 

15 Fishwild et al., Nature Biotech., 14: 845-851 (1996). Also, recombinant 
immunoglobulins may be produced. See, Cabilly, U.S. Patent No. 4,816,567; and 
Queen etal., Proc. Nat'lAcad. Sci. 86: 10029-10033 (1989). 

The antibodies of this invention are also used for affinity 
chromatography in isolating proteins of the present invention. Columns are prepared, 

20 e.g., with the antibodies linked to a solid support, e.g., particles, such as agarose, 
Sephadex, or the like, where a cell lysate is passed through the column, washed, and 
treated with increasing concentrations of a mild denaturant, whereby purified protein 
are released. 

The antibodies can be used to screen expression libraries for particular 
25 expression products such as normal or abnormal protein. Usually the antibodies in such 
a procedure are labeled with a moiety allowing easy detection of presence of antigen by 
antibody binding. 

Antibodies raised against a protein of the present invention can also be 
used to raise anti-idiotypic antibodies. These are useful for detecting or diagnosing 
30 various pathological conditions related to the presence of the respective antigens. 

Frequently, the proteins and antibodies of the present invention will be 
labeled by joining, either covalently or non-covalently, a substance which provides for 
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a detectable signal. A wide variety of labels and conjugation techniques are known and 
are reported extensively in both the scientific and patent literature. Suitable labels 
include radionucleotides, enzymes, substrates, cofactors, inhibitors, fluorescent 
moieties, chemiluminescent moieties, magnetic particles, and the like. 

5 

Protein IfflnffiUMagttS 

Means of detecting the proteins of the present invention are not critical 
aspects of the present invention. In a preferred embodiment, the proteins are detected 
and/or quantified using any of a number of well recognized immunological binding 
assays {see, e.g., U.S. Patents 4,366,241; 4,376,110; 4,517,288; and 4,837,168). For 
a review of the general immunoassays, see also Methods in Cell Biology, Vol. 37: 
Antibodies in Cell Biology, Asai, Ed., Academic Press, Inc. New York (1993); Basic 
and Clinical Immunology 7th Edition, Stites & Terr, Eds. (1991). Moreover, the 
immunoassays of the present invention can be performed in any of several 
configurations, e.g., those reviewed in Enzyme Immunoassay, Maggio, Ed., CRC 
Press, Boca Raton, Florida (1980); Tijan, Practice and Theory of Enzyme 
Immunoassays, Laboratory Techniques in Biochemistry and Molecular Biology, 
Elsevier Science Publishers B.V., Amsterdam (1985); Harlow and Lane, supra\ 
Immunoassay: A Practical Guide, Chan, Ed., Academic Press, Orlando, FL (1987); 
Principles and Practice of Immunoassay sm, Price and Newman Eds., Stockton Press, 
NY (1991); and Non-isotopic Immunoassays, Ngo, Ed., Plenum Press, NY (1988). 
Immunological binding assays (or immunoassays) typically utilize a "capture agent" to 
specifically bind to and often immobilize the analyte (in this case, a protein of the 
present invention). The capture agent is a moiety that specifically binds to the analyte. 
In a preferred embodiment, the capture agent is an antibody that specifically binds a 
protein(s) of the present invention. The antibody may be produced by any of a number 
of means known to those of skill in the art as described herein. 

Immunoassays also often utilize a labeling agent to specifically bind to 
and label the binding complex formed by the capture agent and the analyte. The 
labeling agent may itself be one of the moieties comprising the antibody/analyte 
complex. Thus, the labeling agent may be a labeled protein of the present invention or 
a labeled antibody specifically reactive to a protein of the present invention. 
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Alternatively, the labeling agent may be a third moiety, such as another antibody, that 
specifically binds to the antibody/protein complex. 

In a preferred embodiment, the labeling agent is a second antibody 
bearing a label. Alternatively, the second antibody may lack a label, but it may, in 
turn, be bound by a labeled third antibody specific to antibodies of the species from 
which the second antibody is derived. The second can be modified with a detectable 
moiety, such as biotin, to which a third labeled molecule can specifically bind, such as 
enzyme-labeled streptavidin. 

Other proteins capable of specifically binding immunoglobulin constant 
regions, such as protein A or protein G may also be used as the label agent. These 
proteins are normal constituents of the cell walls of streptococcal bacteria. They exhibit 
a strong non-immunogenic reactivity with immunoglobulin constant regions from a 
variety of species (See, generally Kronval, et a/., 7. Immunol 111: 1401-1406 (1973), 
and Akerstrom, etal., J. Immunol 135: 2589-2542 (1985)). 

Throughout the assays, incubation and/or washing steps may be required 
after each combination of reagents. Incubation steps can vary from about 5 seconds to 
several hours, preferably from about 5 minutes to about 24 hours. However, the 
incubation time will depend upon the assay format, analyte, volume of solution, 
concentrations, and the like. Usually, the assays will be carried out at ambient 
temperature, although they can be conducted over a range of temperatures, such as 
10°C to 40°C. 

While the details of the immunoassays of the present invention may vary 
with the particular format employed, the method of detecting a protein of the present 
invention in a biological sample generally comprises the steps of contacting the 
biological sample with an antibody which specifically reacts, under immunologically 
reactive conditions, to a protein of the present invention. The antibody is allowed to 
bind to the protein under immunologically reactive conditions, and the presence of the 
bound antibody is detected directly or indirectly. 

A. Non-Competitive Assay Formats 

Immunoassays for detecting proteins of the present invention include 
competitive and noncompetitive formats. Noncompetitive immunoassays are assays in 
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which the amount of captured analyte (i.e., a protein of the present invention) is 
directly measured. In one preferred "sandwich" assay, for example, the capture agent 
(e.g., an antibody specifically reactive, under immunoreactive conditions, to a protein 
of the present invention) can be bound directly to a solid substrate where they are 
5 immobilized. These immobilized antibodies then capture the protein present in the test 
sample. The protein thus immobilized is then bound by a labeling agent, such as a 
second antibody bearing a label. Alternatively, the second antibody may lack a label, 
but it may, in turn, be bound by a labeled third antibody specific to antibodies of the 
species from which the second antibody is derived. The second can be modified with a 
10 detectable moiety, such as biotin, to which a third labeled molecule can specifically 
bind, such as enzyme-labeled streptavidin. 

B. Competitive Assay Formats 

In competitive assays, the amount of analyte present in the sample is 
15 measured indirectly by measuring the amount of an added (exogenous) analyte (e.g., a 

protein of the present invention) displaced (or competed away) from a capture agent 

(e.g., an antibody specifically reactive, under immunoreactive conditions, to the 

protein) by the analyte present in the sample. In one competitive assay, a known 

amount of analyte is added to the sample and the sample is then contacted with a 
20 capture agent that specifically binds a protein of the present invention. The amount of 

protein bound to the capture agent is inversely proportional to the concentration of 

analyte present in the sample. 

In a particularly preferred embodiment, the antibody is immobilized on a 

solid substrate. The amount of protein bound to the antibody may be determined either 
25 by measuring the amount of protein present in a protein/antibody complex, or 

alternatively by measuring the amount of remaining uncomplexed protein. The amount 

of protein may be detected by providing a labeled protein. 

A hapten inhibition assay is another preferred competitive assay. In this 

assay a known analyte, (such as a protein of the present invention) is immobilized on a 
30 solid substrate. A known amount of antibody specifically reactive, under 

immunoreactive conditions, to the protein is added to the sample, and the sample is 

then contacted with the immobilized protein. In this case, the amount of antibody 
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bound to the immobilized protein is inversely proportional to the amount of protein 
present in the sample. Again, the amount of immobilized antibody may be detected by 
detecting either the immobilized fraction of antibody or the fraction of the antibody that 
remains in solution. Detection may be direct where the antibody is labeled or indirect 
S by the subsequent addition of a labeled moiety that specifically binds to the antibody as 
described above. 

C. Generation of pooled antiserafor use in immunoassays 

A protein that specifically binds to or that is specifically immunoreactive 

0 with an antibody generated against a defined immunogen, such as an immunogen 
consisting of the amino acid sequence of SEQ ID NOS: 1-18 and 73-75, is determined 
in an immunoassay. The immunoassay uses a polyclonal antiserum which is raised to a 
polypeptide of the present invention (i.e., the immunogenic polypeptide). This 
antiserum is selected to have low crossreactivity against other proteins and any such 

5 crossreactivity is removed by immunoabsorbtion prior to use in the immunoassay (e.g., 
by immunosoibtion of the an ti sera with a protein of different substrate specificity (e.g., 
a different enzyme) and/or a protein with the same substrate specificity but of a 
different form). 

In order to produce an ti sera for use in an immunoassay, a polypeptide 
0 (e.g., SEQ ID NOS: 1-18 and 73-75) is isolated as described herein. For example, 
recombinant protein can be produced in a mammalian or other eukaryotic cell line. An 
inbred strain of mice is immunized with the protein of using a standard adjuvant, such 
as Freund's adjuvant, and a standard mouse immunization protocol (see Harlow and 
Lane, supra). Alternatively, a synthetic polypeptide derived from the sequences 
5 disclosed herein and conjugated to a carrier protein is used as an immunogen. 
Polyclonal sera are collected and titered against the immunogenic polypeptide in an 
immunoassay, for example, a solid phase immunoassay with the immunogen 
immobilized on a solid support. Polyclonal antisera with a titer of 10 4 or greater are 
selected and tested for their cross reactivity against polypeptides of different forms or 
\0 substrate specificity, using a competitive binding immunoassay such as the one 
described in Harlow and Lane, supra, at pages 570-573. Preferably, two or more 
distinct forms of polypeptides are used in this determination. These distinct types of 
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polypeptides are used as competitors to identify antibodies which are specifically bound 
by the polypeptide being assayed for. The competitive polypeptides can be produced as 
recombinant proteins and isolated using standard molecular biology and protein 
chemistry techniques as described herein. 
5 Immunoassays in the competitive binding format are used for 

crossreactivity determinations. For example, the immunogenic polypeptide is 
immobilized to a solid support. Proteins added to the assay compete with the binding 
of the antisera to the immobilized antigen. The ability of the above proteins to compete 
with the binding of the antisera to the immobilized protein is compared to the 

10 immunogenic polypeptide. The percent crossreactivity for the above proteins is 
calculated, using standard calculations. Those antisera with less than 10% 
crossreactivity with a distinct form of a polypeptide are selected and pooled. The cross- 
reacting antibodies are then removed from the pooled antisera by immunoabsoibtion 
with a distinct form of a polypeptide. 

15 The immunoabsorbed and pooled antisera are then used in a competitive 

binding immunoassay as described herein to compare a second "target" polypeptide to 
the immunogenic polypeptide. In order to make this comparison, the two polypeptides 
are each assayed at a wide range of concentrations and the amount of each polypeptide 
required to inhibit 50% of the binding of the antisera to the immobilized protein is 

20 determined using standard techniques. If the amount of the target polypeptide required 
is less than twice the amount of the immunogenic polypeptide that is required, then the 
target polypeptide is said to specifically bind to an antibody generated to the 
immunogenic protein. As a final determination of specificity, the pooled antisera is 
fully immunosoibed with the immunogenic polypeptide until no binding to the 

25 polypeptide used in the immunosorbtion is detectable. The fully immunosorbed antisera 
is then tested for reactivity with the test polypeptide. If no reactivity is observed, then 
the test polypeptide is specifically bound by the antisera elicited by the immunogenic 
protein. 

30 D. Other Assay Formats 

In a particularly preferred embodiment, Western blot (immunoblot) 
analysis is used to detect and quantify the presence of protein of the present invention in 
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the sample. The technique generally comprises separating sample proteins by gel 
electrophoresis on the basis of molecular weight, transferring the separated proteins to a 
suitable solid support, (such as a nitrocellulose filter, a nylon filter, or derivatized 
nylon filter), and incubating the sample with the antibodies that specifically bind a 
5 protein of the present invention. The antibodies specifically bind to the protein on the 
solid support. These antibodies may be directly labeled or alternatively may be 
subsequently detected using labeled antibodies (e.g., labeled sheep anti-mouse 
antibodies) that specifically bind to the antibodies. 

10 E. Quantification of Proteins. 

The proteins of the present invention may be detected and quantified by 
any of a number of means well known to those of skill in the art. These include 
analytic biochemical methods such as electrophoresis, capillary electrophoresis, high 
performance liquid chromatography (HPLC), thin layer chromatography (TLC), 

15 hyperdiffusion chromatography, and the like, and various immunological methods such 
as fluid or gel precipitin reactions, immunodiffusion (single or double), 
Immunoelectrophoresis, radioimmunoassays (RIAs), enzyme-linked immunosorbent 
assays (ELJSAs), immunofluorescent assays, and the like, 

20 F. Reduction of Non-specific Binding 

One of skill will appreciate that it is often desirable to reduce non- 
specific binding in immunoassays and during analyte purification. Where the assay 
involves an antigen, antibody, or other capture agent immobilized on a solid substrate, 
it is desirable to minimize the amount of non-specific binding to the substrate. Means 

25 of reducing such non-specific binding are well known to those of skill in the art. 
Typically, this involves coating the substrate with a proteinaceous composition. Li 
particular, protein compositions such as bovine serum albumin (BSA), nonfat powdered 
milk, and gelatin are widely used. 

30 G. Immunoassay Labels 

The labeling agent can be, e.g., a monoclonal antibody, a polyclonal 
antibody, a binding protein or complex, or a polymer such as an affinity matrix, 
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carbohydrate or lipid. Detectable labels suitable for use in the present invention include 
any composition detectable by spectroscopic, radioisotopic, photochemical, 
biochemical, immunochemical, electrical, optical or chemical means. Detection may 
proceed by any known method, such as immunoblotting, western analysis, gel-mobility 

5 shift assays, fluorescent in situ hybridization analysis (FISH), tracking of radioactive or 
bioluminescent markers, nuclear magnetic resonance, electron paramagnetic resonance, 
stopped-flow spectroscopy, column chromatography, capillary electrophoresis, or other 
methods which track a molecule based upon an alteration in size and/or charge. The 
particular label or detectable group used in the assay is not a critical aspect of the 

10 invention. The detectable group can be any material having a detectable physical or 
chemical property. Such detectable labels have been well-developed in the field of 
immunoassays and, in general, any label useful in such methods can be applied to the 
present invention. Thus, a label is any composition detectable by spectroscopic, 
photochemical, biochemical, immunochemical, electrical, optical or chemical means. 

15 Useful labels in the present invention include magnetic beads, fluorescent dyes, 
radiolabels, enzymes, and colorimetric labels or colored glass or plastic beads, as 
discussed for nucleic acid labels, supra. 

The label may be coupled directly or indirectly to the desired component 
of the assay according to methods well known in the art. As indicated above, a wide 

20 variety of labels may be used, with the choice of label depending on the sensitivity 
required, ease of conjugation of the compound, stability requirements, available 
instrumentation, and disposal provisions. 

Non-radioactive labels are often attached by indirect means. Generally, 
a ligand molecule (e.g., biotin) is covalently bound to the molecule. The ligand then 

25 binds to an anti-ligand (e.g., streptavidin) molecule which is either inherently detectable 
or covalently bound to a signal system, such as a detectable enzyme, a fluorescent 
compound, or a chemiluminescent compound. A number of ligands and anti-ligands 
can be used. Where a ligand has a natural anti-ligand, for example, biotin, thyroxine, 
and Cortisol, it can be used in conjunction with the labeled, naturally occurring anti- 

30 ligands. Alternatively, any haptenic or antigenic compound can be used in combination 
with an antibody. 

The molecules can also be conjugated directly to signal generating 
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compounds, e.g. , by conjugation with an enzyme or fluorophore. Enzymes of interest 
as labels will primarily be hydrolases, particularly phosphatases, esterases and 
glycosidases, or oxidoreductases, particularly peroxidases. Fluorescent compounds 
include fluorescein and its derivatives, rhodamine and its derivatives, dansyl, 
5 umbelliferone, etc. Chemiluminescent compounds include luciferin, and 2,3- 
dihydrophthalazinediones, e.g., luminol. For a review of various labeling or signal 
producing systems which may be used, see, U.S. Patent No. 4,391,904, which is 
incorporated herein by reference. 

Means of detecting labels are well known to those of skill in the art. 

10 Thus, for example, where the label is a radioactive label, means for detection include a 
scintillation counter or photographic film as in autoradiography. Where the label is a 
fluorescent label, it may be detected by exciting the fluorochrome with the appropriate 
wavelength of light and detecting the resulting fluorescence, e.g., by microscopy, 
visual inspection, via photographic film, by the use of electronic detectors such as 

15 charge coupled devices (CCDs) or photomultipliers and the like. Similarly, enzymatic 
labels may be detected by providing appropriate substrates for the enzyme and detecting 
the resulting reaction product. Finally, simple colorimetric labels may be detected 
simply by observing the color associated with the label. Thus, in various dipstick 
assays, conjugated gold often appears pink, while various conjugated beads appear the 

20 color of the bead. 

Some assay formats do not require the use of labeled components. For 
instance, agglutination assays can be used to detect the presence of the target antibodies. 
In this case, antigen-coated particles are agglutinated by samples comprising the target 
antibodies. In this format, none of the components need be labeled and the presence of 

25 the target antibody is detected by simple visual inspection. 



ASSAYS for Compounds that Modulate Enzvmatic Activity o r Expression 

The present invention also provides means for identifying compounds 
that bind to (e.g., substrates), and/or increase or decrease (i.e., modulate) the 
30 enzymatic activity of, catalytically active polypeptides of the present invention. The 
method comprises contacting a polypeptide of the present invention with a compound 
whose ability to bind to or modulate enzyme activity is to be determined. The 
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polypeptide employed will have at least 20%, preferably at least 30% or 40%, more 
preferably at least 50% or 60%, and most preferably at least 70% or 80% of the 
specific activity of the native, full-length lignin biosynthesis polypeptide (e.g., 
enzyme). Generally, the polypeptide will be present in a range sufficient to determine 
5 the effect of the compound, typically about 1 nM to 10 jiM. Likewise, the compound 
will be present in a concentration of from about 1 nM to 10 |iM. Those of skill will 
understand that such factors as enzyme concentration, ligand concentrations (i.e., 
substrates, products, inhibitors, activators), pH, ionic strength, and temperature will be 
controlled so as to obtain useful kinetic data and determine the presence of absence of a 

10 compound that binds or modulates polypeptide activity. Methods of measuring enzyme 
kinetics is well known in the art. See, e.g., Segel, Biochemical Calculations, 2 nd ed., 
John Wiley and Sons, New York (1976). 

Although the present invention has been described in some detail by way 
of illustration and example for purposes of clarity of understanding, it will be obvious 

15 that certain changes and modifications may be practiced within the scope of the 
appended claims. 

This example describes the construction cDNA libraries. 
Total RNA Isolation 

20 Total RNA was isolated from corn tissues with TRIzol Reagent (Life 

Technology Inc. Gaithersburg, MD) using a modification of the guanidine 
isothiocyanate/acid-phenol procedure described by Chomczynski and Sacchi 
(Chomczynski, P., and Sacchi, N. Anal. Biochem. 162, 156 (1987)). In brief, plant 
tissue samples were pulverized in liquid nitrogen before the addition of the TRIzol 

25 Reagent, and then were further homogenized with a mortar and pestle. Addition of 
chloroform followed by centrifugation was conducted for separation of an aqueous 
phase and an organic phase. The total RNA was recovered by precipitation with 
isopropyl alcohol from the aqueous phase. 

30 Poly(A)+ RNA Isolation 

The selection of poly (A) + RNA from total RNA was performed using 
PolyATact system (Promega Corporation. Madison, WI). In brief, biotinylated 
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oligo(dT) primers were used to hybridize to the 3* poly(A) tails on mRNA. The 
hybrids were captured using streptavidin coupled to paramagnetic particles and a 
magnetic separation stand. The mRNA was washed at high stringent condition and 
eluted by RNase-free deionized water. 

5 

cDNA library Construction 

cDNA synthesis was performed and unidirectional cDNA libraries were 
constructed using the Superscript Plasmid System (Life Technology Inc. Gaithersburg, 
MD). The first stand of cDNA was synthesized by priming an oligo(dT) primer 

10 containing a Not I site. The reaction was catalyzed by Superscript Reverse 
Transcriptase n at 45°C. The second strand of cDNA was labeled with alpha- 32 P- 
dCTP and a portion of the reaction was analyzed by agarose gel electrophoresis to 
determine cDNA sizes. cDNA molecules smaller than 500 base pairs and unligated 
adaptors were removed by Sephacryl-S400 chromatography. The selected cDNA 

15 molecules were ligated into pSPORTl vector in between of Not I and Sal I sites. 

fig a^pUg Z 

This example describes cDNA sequencing and library subtraction. 

20 Sequencing Template Preparation 

Individual colonies were picked and DNA was prepared either by PCR 
with M13 forward primers and M13 reverse primers, or by plasmid isolation. All the 
cDNA clones were sequenced using M13 reverse primers. 

25 Q-bot Subtraction Procedure 

cDNA libraries subjected to the subtraction procedure were plated out on 
22 x 22 cm 2 agar plate at density of about 3,000 colonies per plate. The plates were 
incubated in a 37°C incubator for 12-24 hours. Colonies were picked into 384-well 
plates by a robot colony picker, Q-bot (GENETTX Limited). These plates were 
30 incubated overnight at 37°C. 

Once sufficient colonies were picked, they were pinned onto 22 x 22 cm 2 
nylon membranes using Q-bot. Each membrane contained 9,216 colonies or 36,864 
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colonies. These membranes were placed onto agar plate with appropriate antibiotic. 
The plates were incubated at 37°C for overnight. 

After colonies were recovered on the second day, these filters were 
placed on filter paper prewetted with denaturing solution for four minutes, then were 

5 incubated on top of a boiling water bath for additional four minutes. The filters were 
then placed on filter paper prewetted with neutralizing solution for four minutes. After 
excess solution was removed by placing the filters on dry filter papers for one minute, 
the colony side of the filters were place into Proteinase K solution, incubated at 37°C 
for 40-50 minutes. The filters were placed on dry filter papers to dry overnight. DNA 

0 was then cross-linked to nylon membrane by UV light treatment. 

Colony hybridization was conducted as described by Sambrook,J., 
Fritsch, E.F. and Maniatis, T., (in Molecular Cloning: A laboratory Manual, 2 nd 
Edition). The following probes were used in colony hybridization: 

1. First strand cDNA from the same tissue as the library was made from to remove the 
S most redundant clones. 

2. 48-192 most redundant cDNA clones from the same library based on previous 
sequencing data. 

3. 192 most redundant cDNA clones from previous sequencing in corn. 

4. A Sal-A20 oligo nucleotide: TCG ACC CAC GCG TCC GAA AAA AAA AAA 
0 AAA AAA AAA, removes clones containing a poly A tail but no cDNA. 

5. cDNA clones derived from rRNA. 

The image of the autoradiography was scanned into computer and the signal intensity 
and cold colony addresses of each colony was analyzed. Rearraying of cold-colonies 
from 384 well plates to 96 well plates was conducted using Q-bot. 

5 

This example describes the tissue and tissue treatment used for construction 
of cDNA libraries. 

The polynucleotide having the DNA sequences given in SEQ ID 
K) NOS: 19-36 were obtained from the sequencing of a library of cDNA clones prepared 
from maize. The library from which SEQ ID NO: 19 was obtained was constructed 
from premeiotic to uninucleate tassel from line A632. The library from which SEQ ID 
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10 



15 



20 



25 



NO: 20 was obtained was constructed from a shoot culture from the maize line 
Crusader. The library from which SEQ ID NO: 21 was obtained was constructed from 
immature ear of line AP9. The library from which SEQ ID NO:22 was obtained was 
constructed from tissue culture during induced apoptois of line BMS-P2#10. The library 
from which SEQ ID NO: 23 was obtained was constructed from premeiotic to 
uninucleate tassel from line A632. The library from which SEQ ID NO:24 was 
obtained was constructed from early meiotic tassel (16-18 mm). The library from which 
SEQ ID NO:25 was obtained was constructed from corn root worm infested root roots 
of line B73. The library from which SEQ ID NO:26 was obtained was constructed 
from immature ear of line AP9. The library from which SEQ ID NO:27 was obtained 
was constructed from scutelar node of germinating maize seeds of line B73. The 
library from which SEQ ID NO:28 was obtained was constructed from B73 embryo 13 
days after pollination. The library from which SEQ ID NO:29 was obtained was 
constructed from 8-hour heat shock recovery B73 seedling. The library from which 
SEQ ID NO: 30 was obtained was constructed from corn root worm infested root roots 
of line B73. The library from which SEQ ID NO: 31 was obtained was constructed from 
shoot culture of line CM45. The library from which SEQ ID NO:32 was obtained was 
constructed from 8-hour heat shock recovery B73 seedling. The library from which 
SEQ ID NO: 33 was obtained was constructed from root tips (less than 5mm in length) 
of B73. The library from which SEQ ID NO: 34 was obtained was constructed from 
green leaves of B73 treated with jasmonic acid. The library from which SEQ ID NO:35 
was obtained was constructed from green leaves of B73. The library from which SEQ 
ID NO:36 was obtained was constructed from immature ear of inbred B73. The library 
from which SEQ ID NO: 76 was obtained was constructed from ear leaf collar tissue 
after pollen shed from inbred B73. The library from which SEQ ID NO:77 was 
obtained was constructed from leaf collars for the ear leaf of inbred B73. The library 
was subject to a subtraction procedure as described in Example 2. The library from 
which SEQ ID NO:78 was obtained was constructed from a 7 cm. section of the whorl 
from B73 that had been previously infected with European corn borer (1" brood) at the 
V9 (nine node stage, vegetative growth) stage of development. 

The above examples are provided to illustrate the invention but not to 
limit its scope. Other variants of the invention will be readily apparent to one of 
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ordinary skill in the art and are encompassed by the appended claims. All publications, 
patents, and patent applications cited herein are hereby incorporated by reference. 
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1. An isolated nucleic acid comprising a member selected from the 
group consisting of: 

5 (a) a first polynucleotide having at least 60% identity to a second 

polynucleotide encoding a polypeptide selected from the group consisting of SEQ ID 
NOS: 1-18 and 73-75, wherein said first polynucleotide encodes a polypeptide which 
when presented as an immunogen elicits the production of an antibody which is 
specifically reactive to said second polypeptide; 

10 (b) a polynucleotide which is complementary to said first polynucleotide 

of (a); and 

(c) a polynucleotide comprising at least 25 contiguous nucleotides from a 
first polynucleotide of (a) or a polynucleotide of (b). 

15 2. The isolated nucleic acid of claim 1, wherein said polynucleotide 

has a sequence selected from the group consisting of SEQ ID NOS: 19-36 and 76-78. 

3. A recombinant expression cassette, comprising a nucleic acid of 
claim 1 operably linked to a promoter. 

20 

4. The recombinant expression cassette of claim 3, wherein said 
nucleic acid is operably linked in antisense orientation to said promoter. 

5. A host cell introduced with the recombinant expression cassette of 

25 claim 3. 

6. The host cell of claim 5, wherein said host cell is a sorghum 
(Sorghum bicolof) or maize (Zea mays) cell. 

30 7. The isolated nucleic acid of claim 1, wherein the polynucleotide 

is DNA. 
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8. An isolated protein comprising a polypeptide of at least 10 
contiguous amino acids encoded by the isolated nucleic acid of claim 2. 

9. The protein of claim 8, wherein said polypeptide has a sequence 
5 selected from the group consisting of SEQ ID NOS: 1-18 and 73-75. 

10. An isolated nucleic acid comprising a polynucleotide of at least 
25 nucleotides in length which selectively hybridizes under stringent conditions to a 
nucleic acid selected from the group consisting of SEQ ID NOS: 19-36 and 76-78, or a 

10 complement thereof. 

11. The isolated nucleic acid of claim 10 operably linked to a 

promoter. 

15 12. An isolated nucleic acid comprising a polynucleotide, said 

polynucleotide having at least 80% sequence identity to an identical length of selected 
from the group consisting of SEQ ID NOS: 19-36 and 76-78 or a complement thereof. 

13. An isolated nucleic acid comprising a polynucleotide having a 
20 sequence of a nucleic acid amplified from a Zea mays nucleic acid library using the 
primers selected from the group consisting of: 37-72 and 79-84 or complements 



14. The isolated nucleic acid of claim 13, wherein said nucleic acid 
25 library is a cDNA library. 

15. A recombinant expression cassette comprising a nucleic acid of 
claim 13 operably linked to a promoter. 



thereof. 



30 16. A host cell comprising the recombinant expression cassette of 

claim 15. 
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17. A protein produced from the host cell of claim 16 by expressing 
said protein encoded by said nucleic acid. 



18. An isolated nucleic acid comprising a polynucleotide encoding a 
5 polypeptide wherein: 

(a) said polypeptide comprises at least 10 contiguous amino acid residues 
from a first polypeptide selected from the group consisting of SEQ ID NOS: 1-18 and 
73-75, and wherein said polypeptide, when presented as an immunogen, elicits the 
production of an antibody which specifically binds to said first polypeptide; 
10 (b) said polypeptide does not bind to an ti sera raised against said first 

polypeptide which has been fully immunosorbed with said first polypeptide; 

(c) said polypeptide has a molecular weight in non-glycosylated form 
within 10% of said first polypeptide. 

15 19. A heterologous promoter operably linked to a non-isolated lignin 

biosynthesis polynucleotide encoding a polypeptide encoded by the nucleic acid of claim 
13. 

20. A transgenic plant comprising a recombinant expression cassette 
20 comprising a plant promoter operably linked to an isolated nucleic acid of claim 1. 

21. The transgenic plant of claim 20, wherein said plant is Zea mays. 

22. A transgenic seed from the transgenic plant of claim 20. 

25 

23. The transgenic seed of claim 22, wherein the seed is from Zea 

mays. 



24. A method of modulating lignin biosynthesis in a plant, 

30 comprising: 

(a) transforming a plant cell with a recombinant expression 
cassette comprising a lignin biosynthesis polynucleotide operably linked to a promoter; 
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(b) growing the plant cell under plant growing conditions; and 

(c) inducing expression of said polynucleotide for a time 
sufficient to modulate lignin biosynthesis in said plant. 

5 25. The method of claim 24, wherein the plant is maize. 

26. The method of claim 24, wherein lignin biosynthesis is increased. 
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SEQUENCE LISTING 

<110> Helent jaris, Timothy G. 
Bowen, Benjamin A. 
Wang , Xun 

<120> Genes Encoding Enzymes for Lignin 
Biosynthesis and Uses Thereof 



<130> 0709 

<150> 60/057,082 
<151> 1997-08-27 

<150> 09/076,851 
<151> 1998-05-12 

<160> 84 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 559 

<212> PRT 

<213> Zea mays 





<400> 


l 


























Met 


Gly Asp Ala 


Ala 


He 


Ala 


Ala 


Val 


His 


Leu 


His 


GlU 


Ser 


Glu 


Glu 


1 






5 










10 










15 




GlU 


His He 


Phe 


Arg 


Ser 


Arg 


Phe 


Pro 


Pro 


Val 


Ala 


Val 


Pro 


Asp 


Asp 






20 










25 










30 






val 


Thr Val 


Pro 


Glu 


Phe 


Val 


Leu 


Ala 


Asp 


Ala 


Glu 


Ala 


Ty r 


Ala 


Asp 




35 










40 










45 








Lys 


Thr Ala 


Leu 


Val 


Glu 


Ala 


Ala 


Pro 


Gly 


Gly 


Arg 


Ser 


Tyr 


Thr 


Tyr 




50 








55 










60 










Gly 


Glu Leu 


Val 


Arg 


Asp 


Val 


Ala 


Arg 


Phe 


Ala 


Arg 


Ala 


Leu 


Arg 


Ser 


65 








70 










75 










80 


He 


Gly Val 


Arg 


Arg 


Gly 


His 


Val 


Val 


Val 


Val 


Ala 


Leu 


Pro 


Asn 


Leu 






85 










90 










95 




Ala 


Val Tyr 


Pro 


Val 


Val 


Ser 


Leu 


Gly 


He 


Met 


Ser 


Ala 


Gly 


Ala 


Val 




100 










105 










110 






Phe 


Ser Gly 


Val 


Asn 


Pro 


Arg 


Ala 


Val 


Ala 


Ala 


Glu 


He 


Lys 


Lys 


Gin 




115 










120 










125 








Val 


Glu Asp 


Ser 


GlU 


Ala 


Arg 


Leu 


Val 


Val 


Ala 


Asp 


Ala 


Val 


Ala 


Tyr 




130 








135 










140 










Asp 


Lys Val 


Lys 


Asp 


Ala 


Gly 


Val 


Pro 


Val 


He 


Gly 


He 


Gly 


Asp 


Val 


145 








150 










155 










160 


Ala 


Arg Leu 


Pro 


Gly 


Ala 


He 


Gly 


Trp 


Asp 


Glu 


Leu 


Leu 


Al* Met Ala 



165 170 175 
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Asp Arg Ala Gly Ala Pro Val Val Ala Leu Glu Pro Ala Gin Gin Ser 

180 185 190 

Asp Leu Cys Ala Leu Pro Tyr Ser Ser Gly Thr Thr Gly Val Ser Lys 

195 200 " 205 

Gly Val Met Leu Ser His Arg Asn Leu Val Ser Ser Leu Cys Ser Ser 

210 215 220 

Met Phe Ala Val Gly Gin Glu Leu Val Gly Gin Val Val Thr Leu Gly 
225 230 235 240 

Leu Met Pro Phe Phe His lie Tyr Gly lie Thr Gly lie Cys Cys Ala 

245 250 ^ 255 

Thr Leu Arg His Lys Gly Thr Val Val Val Met Asp Arg Phe Asp Leu 

260 265 270 

Arg Ala Phe Leu Gly Ala Leu Leu Thr His Arg Val Met Phe Ala Pro 

275 280 285 

Val Val Pro Pro Val Met Leu Ala Met Val Lys Ser Pro Val Ala Asp 

290 295 300 

Glu Phe Asp Leu Ser Gly Leu Ala Leu Arg Ser Val Met Thr Ala Ala 
305 310 315 320 

Ala Pro Leu Ala Pro Asp Leu Leu Ala Ala Phe Glu Arg Lys Phe Pro 

325 330 335 

Gly Val Gin Val Glu Glu Ala Tyr Gly Leu Thr Glu His Ser Cys lie 

340 345 350 

Thr Leu Thr His Ala Ser Gly Gly Gly Glu Asp Val Gly Ser Ala Val 

355 360 365 

Gin Val Ala Lys Lys Lys Ser Val Gly Phe lie Leu Pro Asn Leu Glu 

370 375 380 

Val Lys Phe val Asp Pro Asp Thr Gly Arg Ser Leu Pro Lys Asn Thr 
385 390 395 400 

Pro Gly Glu lie Cys Val Arg Ser Gin Ala Val Met Gin Gly Tyr Tyr 

405 410 415 

Arg Lys Lys Glu Glu Thr Glu Arg Thr lie Asp Ala Ala Gly Trp Leu 

420 425 430 

His Thr Gly Asp Val Gly Tyr lie Asp Asp Asp Gly Asp Val Phe lie 

435 440 445 

Val Asp Arg lie Lys Glu Leu lie Lys Tyr Lys Gly Phe Gin Val Ala 

450 455 460 

Pro Ala Glu Leu Glu Ala lie Leu Leu Ser His Pro Ser Val Glu Asp 
465 470 475 480 

Ala Ala Val Phe Gly Leu Pro Asp Glu Glu Ala Gly Glu Val Pro Ala 

485 490 495 

Ser Cys Val Val Arg Arg Arg Gly Ala Pro Glu Ser Glu Ala Asp Met 

500 505 510 

Met Ala Tyr Val Ala Gly Arg Val Ala Ser Tyr Lys Lys Leu Arg Leu 

515 520 525 

Leu Arg Phe Val Asp Ala lie Pro Lys Ser Val Ser Gly Lys lie Leu 

530 535 540 

Arg Arg Gin Leu Arg Asp Glu Phe Val Lys Lys Thr Ala Ala Ala 
545 550 555 



<210> 2 
<211> 555 
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<212> 


PRT 




<213> 


Zea 




<400> 




Met 


Gly 


Ser 


Val 


1 








Glu 


Lys 


Ala 


Val 






20 


Asp 


He 


Glu 


He 










Lys 


Met 


Gly 


Glu 




50 






Gly Ala 


Ser 


Tyr 


65 








Ala 


Ser 


Gly 


Leu 


Ser 


Leu 


Leu 


Arg 








100 


Ala 


Arg 


Leu 


Gly 






115 


* 


His 


Glu 


Val 


His 




130 






Thr 


Glu 


Ala 


Cys 


145 








Gly 


He 


Pro 


Val 


Phe 


Ala 


Glu 


Leu 








180 


His 


Pro 


Asp 


Asp 






195 




Leu 


Pro 


Lys 


Gly 




210 






Ala 


Gin 


Gin 


Val 


225 








Asp 


Val 


Val 


Leu 


Ser 


Val 


Leu 


Leu 








260 


Arg 


Lys 


Phe 


Asp 






275 




He 


Thr 


He 


Ala 




290 






ser 


Pro 


Arg 


Val 


305 








ser 


Gly 


Ala 


Ala 


Lys 


He 


Pro 


Asn 








340 


Gly 


Pro 


Val 


Leu 






355 




Val 


Lys 


Ser 


Gly 



mays 



Asp 


Ala 


Ala 


He 


5 








Glu 


Glu 


Lys 


Ala 


Asp 


Ser 


Ser 


Met 








40 


Val 


Ala 


Glu 


Arg 






R c> 
O D 




Thr 


Tyr 


Ala 


Glu 




70 






Arg 


Ala 


Met 


Gly 


85 








Asn 


Cys 


Pro 


Glu 


Ala 


Ala 


Thr 


Thr 








120 


Arg 


Gin 


Ala 


Glu 






-L J — > 




Ala 


Val 


Glu 


Lys 




150 




Val 


Thr 


Val 


Asp 


165 








He 


Ala 


Ala 


Glu 


Val 


Val 


Ala 


Leu 








200 


Val 


Met 


Leu 


Thr 






215 




Asp 


Gly 


Glu 


Asn 




230 






Cys 


Leu 


Leu 


Pro 


245 








Ala 


Gly 


Leu 


Arg 


Leu 


Gly Ala 


Leu 








280 


Pro 


Phe 


Val 


Pro 






295 




Thr 


Ala 


Gly 


Asp 




310 






Pro 


Met 


Gly 


Lys 


325 








Ala 


Val 


Leu 


Gly 


Ala 


Met 


Cys 


Leu 








360 


Ser 


Cys 


Gly 


Thr 



Ala 


Val 


Pro 


Val 




10 






Met 


Val 


Phe 


Arg 


25 






Ala 


Leu 


His 


Thr 


Ala 


Cys 


Leu 


He 








60 


Val 


Glu 


Ser 


Leu 






/5 




Val 


Gly 


Lys 


Gly 




90 






Phe 


Ala 


Phe 


Thr 


105 








Thr 


Ala 


Asn 


Pro 


Ala 


Ala 


Gly 


Ala 








140 


Val 


Arg 


Glu 


Phe 










Gly Arg 


Phe 


Asp 




170 






Glu 


Leu 


Glu 


Ala 


185 








Pro 


Tyr 


Ser 


Ser 


His 


Arg 


Ser 


Leu 








220 


Pro 


Asn 


Leu 


Tyr 










Leu 


Phe 


His 


He 




250 






Ala 


Gly 


Ser 


Thr 


265 








Val 


Asp 


Leu 


Val 


Pro 


He 


Val 


Val 








300 


Leu 


Ala 


Ser 


He 






315 




Glu 


Leu 


Gin 


Asp 




330 






Gin 


Gly 


Tyr 


Gly 


345 








Ala 


Phe 


Ala 


Lys 


Val 


Val 


Arg 


Asn 



Pro 


Ala 


Ala 


Glu 






15 




Ser 


Lys 


Leu 


Pro 




30 






Tyr 


Cys 


Phe 


Glv 


45 








Asp 


Gly 


Leu 


Thr 


Ser 


Arg 


Arg 


Ala 








80 


Asp 


Val 


Val 


Met 






95 




Phe 


Leu 


Gly 


Ala 




110 






Phe 


Tvr 


Thr 


Pro 


125 








Ara 


Leu 


He 


Val 


Ala 


Ala 


Glu 


Ara 








160 


Gly 


Cys 


Val 


Glu 






175 




Asp 


Ala 


Asp 


He 




190 






Gly 


Thr 


Thr 


Gly 


205 








He 


Thr 


Ser 


Val 


Phe 


Arg 


Lys 


Asp 








240 


Tyr 


Ser 


Leu 


Asn 






255 




He 


Val 


He 


Met 




270 






Arg 


Arg 


Tyr 


Val 


285 








Glu 


He 


Ala 


Lys 


Arg 


Met 


Val 


Met 








320 


Ala 


Phe 


Met 


Ala 






335 




Met 


Thr 


Glu 


Ala 




350 






Glu 


Pro 


Tyr 


Pro 


365 








Ala 


Glu 


Leu 


Lys 
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370 










375 




lie 


Val 


Asp 


Pro 


Asp 


Thr 


Gly 


Ala 


385 










390 






Glu 


He 


Cys 


He 


Arg 


Gly 


Glu 


Gin 










405 








Pro 


Glu 


Ser 


Thr 


Lys 


Asn 


Thr 


He 








420 










Gly Asp 


He 


Gly 


Tyr 


Val 


Asp 


Asp 






435 










440 


Ara 


Leu 


Lys 


Glu 


He 


He 


Lys 


Tyr 




450 










455 




Glu 


Leu 


Glu 


Ala 


Leu 


Leu 


He 


Thr 


465 










470 






Val 


Val 


Ser 


Met 


Asn 


Asp 


Asp 


Leu 










485 








lie 


Val 


Arg 


Thr 


Glu 


Gly 


Ser 


Gin 








500 










Phe 


Val 


Ala 


Lys 


Glu 


Val 


Val 


Phe 






515 










520 


Phe 


Thr 


Glu 


Ser 


He 


Pro 


Lys 


Asn 




530 










535 




Asp 


Leu 


Arg 


Ala 


Arg 


Leu 


Ala 


Ala 


545 










550 








<210> 


3 












<211> 


575 












<212> 


PRT 












<213> 


Zea 


mays 








<400> 


3 










Met 


He 


Thr 


Val 


Ala 


Ala 


Pro 


Glu 


1 








5 








Ala 


Ala 


Val 


Ala 


Ala 


Pro 


Glu 


Glu 








20 










Asp 


He 


Asp 


He 


Pro 


Thr 


His 


Leu 






35 










40 


Arg 


Ala 


Ala 


Glu 


Ala 


Ala 


Gly 


Ala 




50 










55 




Gly Arg 


Thr 


Tyr 


Thr 


Tyr 


Ala 


Glu 


65 










70 






Ala 


Ala 


Cys 


Leu 


His 


Gly 


Leu 


Gly 










85 








Leu 


Leu 


Leu 


Gin 


Asn 


Ser 


Val 


Glu 








100 










Ser 


Phe 


Leu 


Gly Ala 


Val 


Thr 


Thr 






115 










120 


Gin 
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20 25 30 

Asp Val Thr lie Lys Val Leu Phe Cys Gly Leu Cys His Thr Asp Leu 

35 40 45 

His Val lie Lys Asn Glu Phe Gly Asn Ala Lys Tyr Pro Val Val Pro 

50 55 60 

Gly His Glu lie Val Gly Val Val Thr Asp Val Gly Ser Gly Val Thr 
65 70 75 80 

Ser Phe Lys Pro Gly Asp Thr Val Gly Val Gly Tyr Phe Val Asp Ser 

85 90 95 

Cys Arg Ser Cys Asp Ser Cys Ser Lys Gly Tyr Glu Ser Tyr Cys Pro 

100 105 110 

Gin Leu Val Glu Thr Ser Asn Gly Val Ser Leu Asp Asp Asp Asp Gly 

115 120 125 

Gly Ala Thr Thr Lys Gly Gly Phe Ser Asp Ala Leu Val Val His Gin 

130 135 140 

Arg Tyr Val Val Arg Val Pro Ala Ser Leu Pro Pro Ala Gly Ala Ala 
145 150 155 160 

Pro Leu Leu Cys Ala Gly Val Thr Val Phe Ser Pro Met Val Gin Tyr 

165 170 175 

Gly Leu Asn Ala Pro Gly Lys His Leu Gly Val Val Gly Leu Gly Gly 

180 185 190 

Leu Gly His Leu Ala Val Arg Phe Gly Lys Ala Phe Gly Met Lys Val 

195 200 205 

Thr Val lie Ser Thr Ser Leu Gly Lys Arg Asp Glu Ala Leu Gly Arg 

210 215 220 

Leu Gly Ala Asp Ala Phe Leu Val Ser Arg Asp Pro Glu Gin Met Arg 
225 230 235 240 

Ala Ala Ala Gly Thr Leu Asp Gly Val lie Asp Thr Val Ser Ala Asp 

245 250 255 

His Pro Val Val Pro Leu Leu Asp Leu Leu Lys Pro Met Gly Gin Met 

260 265 270 

Val Val Val Gly Leu Pro Thr Lys Pro Leu Gin Val Pro Ala Phe Ser 

275 280 285 

Leu Val Ala Gly Gly Lys Arg Val Ala Gly Ser Ala Gly Gly Gly Val 
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290 

Gly Glu Cys Gin 
305 

Ala Asp Val Glu 

Arg Leu Glu Arg 
340 

Gly Ser Lys lie 
355 



295 

Ala Met Leu Asp 
310 

Val Val Gly Met 
325 

Asn Asp Val Arg 
Gly Gly 



300 

Phe Ala Gly Glu 
315 

Asp Tyr Val Asn 
330 

Tyr Arg Phe Val 
345 



His Gly lie Thr 
320 

Thr Ala lie Gin 
335 

Val Asp Val Ala 
350 



<210> 13 
<211> 258 
<212> PRT 
<213> Zea mays 



<400> 13 

Met Ala Thr Thr Ala Thr Glu Ala Ala Pro Ala Gin Glu Gin Gin Ala 

15 10 15 

Asn Gly Asn Gly Glu Gin Lys Thr Arg His Ser Glu Val Gly His Lys 

20 25 30 

Ser Leu Leu Lys Ser Asp Asp Leu Tyr Gin Tyr lie Leu Asp Thr Ser 

35 40 45 

Val Tyr Pro Arg Glu Pro Glu Ser Met Lys Glu Leu Arg Glu lie Thr 

50 55 60 

Ala Lys His Pro Trp Asn Leu Met Thr Thr Ser Ala Asp Glu Gly Gin 
65 A 70 75 80 

Phe Leu Asn Met Leu lie Lys Leu lie Gly Ala Lys Lys Thr Met Glu 

85 90 95 

lie Gly Val Tyr Thr Gly Tyr Ser Leu Leu Ala Thr Ala Leu Ala Leu 

100 * 105 110 

Pro Glu Asp Gly Thr lie Leu Ala Met Asp lie Asn Arg Glu Asn Tyr 

115 ~ 120 125 

Glu Leu Gly Leu Pro Cys lie Glu Lys Ala Gly Val Ala His Lys lie 

130 135 140 

Asp Phe Arg Glu Gly Pro Ala Leu Pro Val Leu Asp Asp Leu lie Ala 
145 150 155 160 

Glu Glu Lys Asn His Gly Ser Phe Asp Phe Val Phe Val Asp Ala Asp 

165 170 175 

Lys Asp Asn Tyr Leu Asn Tyr His Glu Arg Leu Leu Lys Leu Val Lys 

180 185 190 

Leu Gly Gly Leu lie Gly Tyr Asp Asn Thr Leu Trp Asn Gly Ser Val 

195 200 205 

Val Leu Pro Asp Asp Ala Pro Met Arg Lys Tyr He Arg Phe Tyr Arg 

210 215 220 

Asp Phe Val Leu Val Leu Asn Lys Ala Leu Ala Ala Asp Asp Arg Val 
225 230 235 240 

Glu He Cys Gin Leu Pro Val Gly Asp Gly Val Thr Leu Cys Arg Arg 
245 250 255 

Val Lys 
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<210> 14 
<211> 248 
<212> PRT 
<213> Zea mays 

<400> 14 

Met Ala Ser Ala Gly Ala Gly Glu Gly Lys Glu Thr Ala Ala Gly Ser 

1 5 10 15 

Ser Leu His Ser Lys Thr Leu Leu Lys Ser Gin Pro Leu Tyr Gin Tyr 

20 25 30 

lie Leu Glu Ser Thr Val Phe Pro Arg Glu Pro Asp Cys Leu Arg Glu 

35 40 45 

Leu Arg Val Ala Thr Ala Thr His Pro Met Ala Gly Met Ala Ala Ser 

50 55 60 

Pro Asp Glu Val Gin Leu Leu Gin Leu Leu lie Glu lie Leu Gly Ala 
65 70 75 80 

Lys Asn Ala lie Glu Val Gly Val Phe Thr Gly Tyr Ser Leu Leu Ala 

85 90 95 

Thr Ala Leu Ala Leu Pro Asp Asp Gly Lys He Val Ala He Asp Val 

100 105 110 

Thr Arg Glu Ser Tyr Asp Gin lie Gly Ser Pro Val lie Glu Lys Ala 

115 120 125 

Gly Val Ala His Lys He Asp Phe Arg Val Gly Leu Ala Leu Pro Val 

130 135 140 

Leu Asp Gin Met Val Ala Glu Glu Gly Asn Lys Gly Lys Phe Asp Phe 
145 150 155 160 

Ala Phe Val Asp Ala Asp Lys Val Asn Phe Leu Asn Tyr His Glu Arg 

165 170 175 

Leu Leu Gin Leu Leu Arg Val Gly Gly Leu He Ala Tyr Asp Asn Thr 

180 185 190 

Leu Trp Gly Gly Ser Val Ala Ala Ser Pro Asp Glu Pro Leu Ser Glu 

195 200 205 

Arg Asp Arg Ala Leu Ala Ala Ala Thr Arg Glu Phe Asn Ala Ala Val 

210 215 220 

Ala Ala Asp Pro Arg Val His Val Cys Gin Val Ala He Ala Asp Gly 
225 230 235 240 

Leu Thr Leu Cys Arg Arg Val Ala 
245 

<210> 15 
<211> 248 
<212> PRT 
<213> Zea mays 

<400> 15 

Met Ala Ala Gly Gly Asp Asp Thr Thr He Ala Gin Val His Ser Gly 

15 10 15 

He Asp Ser Ser Asn Lys Thr Leu Leu Lys Ser Glu Ala Leu Tyr Lys 

20 25 30 

Tyr Val Leu Asp Thr Ser Val Leu Pro His Glu Pro Glu Ser Met Arg 
35 40 45 
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Glu 


Leu 
50 


Arg 


Leu 


Val 


Thr 


Asp 
55 


Lys 


His 


Glu 


Trp 


Gly 
60 


Phe 


Met 


Gin 


Ser 


Ser 


Pro 


Asp 


Glu 


Ala 


Ser 


Leu 


Leu 


Arg 


Met 


Leu 


He 


Lys 


Leu 


Ser 


Gly 


65 










70 










75 










80 


Ala 


Arg 


Arg 


Thr 


Leu 
85 


Glu 


Val 


Gly 


Val 


Phe 
90 


Thr 


Gly 


Tyr 


Ser 


Leu 
95 


Leu 


Ala 


Thr 


Ala 


Leu 
100 


Ala 


Leu 


Pro 


Ala 


Asp 
105 


Gly 


Lys 


Val 


He 


Ala 
110 


Phe 


Asp 


Val 


Ser 


Arg 


Glu 


Tyr 


Tyr Asp 


He 


Gly 


Arg 


Pro 


Phe 


He 


Glu 


Arg 


Ala 






115 










120 










125 








Gly Val 


Ala 


Gly 


Lys 


Val 


Asp 


Phe 


Arg 


Glu 


Gly 


Pro 


Ala 


Leu 


Glu 


Gin 




130 










135 










140 










Leu 


Asp 


Glu 


Leu 


Leu 


Ala 


Asp 


Pro 


Ala 


Asn 


His 


Gly 


Ala 


Phe 


Asp 


Phe 


145 










150 










155 








160 


Ala 


Phe 


val 


Asp 


Ala 
165 


Asp 


Lys 


Pro 


Asn 


Tyr 
170 


Val 


Arg 


Tyr 


His 


Glu 
175 


Gin 


Leu 


Leu 


Arg 


Leu 
180 


Val 


Arg 


Val 


Gly 


Gly 
185 


Thr 


Val 


Val 


Tyr 


Asp 
190 


Asn 


Thr 


Leu Trp 


Ala 


Gly Thr 


Val 


Ala 


Leu 


Pro 


Pro 


Asp 


Ala 


Pro 


Leu 


Ser 


Asp 






195 










200 










205 






Leu 


Asp 


Arg 


Arg 


Phe 


Ser 


Ala 


Ala 


He 


Arg Glu 


Leu 


Asn 


Val 


Arg 


Leu 




210 










215 










220 










ser 


Gin 


Asp 


Pro Arg 


Val 


Glu 


Val 


Cys 


Gin 


Leu 


Ala 


He 


Ala 


Asp 


Gly 


225 










230 










235 










240 


Val 


Thr 


He 


Cys 


Arg Arg Val 


Val 



















245 



<210> 16 
<211> 371 
<212> PRT 
<213> Zea mays 



<400> 16 



Met 


Thr 


Val 


Val 


Asp Ala Val 


Val 


Ser 


Ser 


Thr 


Asp 


Ala 


Gly 


Ala 


Pro 


1 








5 










10 










15 




Ala 


Ala 


Ala 


Ala 


Thr 


Ala 


Val 


Pro 


Ala 


Gly 


Asn 


Gly 


Gin 


Thr 


Val 


Cys 








20 










25 








30 






Val 


Thr 


Gly 
35 


Ala 


Ala 


Gly 


Tyr 


He 
40 


Ala 


Ser 


Trp 


Leu 


Val 
45 


Lys 


Leu 


Leu 


Leu 


Glu 
50 


Lys 


Gly 


Tyr 


Thr 


Val 
55 


Lys 


Gly 


Thr 


Val 


Arg 
60 


Asn 


Pro 


Asp 


Asp 


Pro 


Lys 


Asn 


Ala 


His 


Leu 


Lys 


Ala 


Leu Asp 


Gly Ala 


Ala 


Glu 


Arg 


Leu 


65 










70 










75 










80 


He 


Leu 


Cys 


Lys 


Ala 
85 


Asp 


Leu 


Leu 


Asp 


Tyr 
90 


Asp 


Ala 


He 


Cys 


Arg 
95 


Ala 


Val 


Gin 


Gly 


Cys 


Gin 


Gly Val 


Phe 


His 


Thr 


Ala 


Ser 


Pro 


Val 


Thr 


Asp 








100 










105 










110 






Asp 


Pro 


Glu 


Gin 


Met 


Val 


Glu 


Pro 


Ala 


Val 


Arg 


Gly 


Thr 


Glu 


Tyr 


Val 






115 










120 






125 






He 


Asn 
130 


Ala 


Ala 


Ala 


Asp 


Ala 
135 


Gly 


Thr 


Val 


Arg 


Arg 
140 


Val 


Val 


Phe 


Thr 



WO 99/10498 



PCT/US98/17519 



19 



Ser 


ser 


He 


Gly Ala 


Val 


Thr 


Met 


Asp 


Pro 


Lys 


Arg 


Gly 


Pro 


Asp 


Val 












150 

J. *mt <J 










155 










160 


Val 


Val 


Asp 




165 


Cys 


Trp 


Ser 


Asp 


Leu 
170 


Glu 


Phe 


Cys 


Glu 


Lys 

175 


Thr 


Arg 


Asn 


Trp 


Tyr 




Tyr 


Gly 


Lys 


Ala 


Val 


Ala 


Glu 


Gin 


Ala 


Ala 


Trp 






180 










185 










190 






Glu 


Thr 


Ala 

X -y ~> 


Arg 


Aiy 


Arg 


Gly 


Val 
200 


Asp 


Leu 


Val 


Val 


Val 
205 


Asn 


Pro 


Val 


T on 


V u J. 


Val 


Ljxy 


Pro 


Leu 


Leu 


Gin 


Ala 


Thr 


Val 


Asn 


Ala 


Ser 


He 


Ala 












215 










220 










nib 


He 


T on 


Lys 


Tyr 


Leu 


Asp 


Glv 


Ser 


Ala 


Arg 


Thr 


Phe 


Ala 


Asn 


Ala 


£ ZD 




















235 










240 


Val 


Gin 


Ala 


Tyr 


vai 
245 


Asp 


Val 


Arg 


Asp 


Val 
250 


Ala 


Asp 


Ala 


His 


Leu 

255 


Arg 


Val 


Phe 


Glu 


Coy 

260 


.fcr XT U 


Arg 


Ala 


Ser 


Gly 
265 


Arg 


Xaa 


Leu 


Cys 


Ala 
270 


GlU 


Arg 


Val 


Leu 


His 
275 


Arg 


Glu 


Asp 


Val 


Val 
280 


Arg 


He 


Leu 


Ala 


Lys 
285 


Leu 


Phe 


Pro 


Glu 


Tyr 
290 


Pro 


Val 


Pro 


Ala 


Arg 
295 


Cys 


Ser 


Asp 


Glu 


Val 
300 


Asn 


Pro 


Arg 


Lys 


Gin 


Pro 


Tyr 


Lys 


Phe 


Ser 


Asn 


Gin 


Lys 


Leu 


Arg 


Asp 


Leu 


Gly 


Leu 


Gin 


305 










310 










315 










320 


Phe 


Arg 


Pro 


Val 


Ser 


Gin 


Ser 


Leu 


Tyr 


Asp 


Thr 


Val 


Lys 


Asn 


Leu 


Gin 








325 










330 










335 




Glu 


Lys 


Gly 


His 
340 


Leu 


Pro 


Val 


Leu 


Gly 
345 


Glu 


Arg 


Thr 


Thr 


Thr 
350 


Glu 


Ala 


Ala 


Asp 


Lys 
355 


Asp 


Ala 


Pro 


Thr 


Ala 
360 


Glu 


Met 


Gin 


Gin 


Gly 
365 


Gly 


He 


Ala 


lie Arg Ala 





























370 



<210> 17 
<211> 177 
<212> PRT 
<213> Zea mays 

<400> 17 

Thr Arg Pro Val Val Gly Leu Asp Arg Asn Val Ser Glu Ser Asp Leu 

15 10 15 

Asp Arg Leu Pro Phe Leu Arg Cys Val He Lys Glu Thr Leu Arg Leu 

20 25 30 

His Pro Pro He Pro Leu Leu Leu His Glu Thr Ala Asp Asp Cys Val 

35 40 45 

Val Ala Gly Tyr Ser Val Pro Arg Gly Ser Arg Val Met Val Asn Val 

50 55 60 

Trp Ala He Gly Arg His Arg Ala Ser Trp Lys Asp Ala Asp Ala Phe 
65 70 75 80 

Arg Pro Ser Arg Phe Ala Ala Pro Glu Gly Glu Ala Ala Gly Leu Asp 

85 90 95 

Phe Lys Gly Gly Cys Phe Glu Phe Leu Pro Phe Gly Ser Gly Arg Arg 
100 105 110 
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Ser Cys Pro Gly Met Ala Leu Gly Leu Tyr Ala Leu Glu Leu Ala Val 

115 120 125 

Ala Gin Leu Ala His Ala Phe Asn Trp Ser Leu Pro Asp Gly Met Lys 

130 135 140 

Pro Ser Glu Met Asp Met Gly Asp lie Phe Gly Leu Thr Ala Pro Arg 
145 150 155 160 

Ala Thr Arg Leu Tyr Ala Val Pro Thr Pro Arg Leu Asn Cys Pro Leu 
165 170 175 

Tyr 



<210> 18 
<211> 235 
<212> PRT 
<213> Zea mays 

<400> 18 

Ala Arg Asp Phe Pro Asp Gly Pro Pro Pro Ser Gly Thr Ala Met Ser 

1 5 10 15 

Val Gly Thr Lys Leu Asn Lys Leu Ser Tyr Asn Ser Val Val Glu lie 

20 25 30 

Val Leu Gin Asn Pro Ala Ala Val Pro Thr Glu Asn His Pro lie His 

35 40 45 

Leu His Gly Phe Asn Phe Phe Val Leu Ala Gin Gly Met Gly Thr Phe 

50 55 60 

Ala Pro Gly Ser Val Ala Tyr Asn Leu Val Asp Pro Val Ala Arg Asn 
65 70 75 80 

Thr lie Ala Val Pro Gly Gly Gly Trp Ala Val lie Arg Phe Val Ala 

85 90 95 

Asn Asn Pro Gly Met Trp Phe Phe His Cys His Leu Asp Pro His Val 

100 105 110 

Pro Met Gly Leu Gly Met Val Phe Gin Val Asp Ser Gly Thr Thr Pro 

115 120 125 

Gly Ser Thr Leu Pro Thr Pro Pro Gly Asp Trp Val Gly Val Cys Asp 

130 135 140 

Ala Gin His Tyr Ala Ala Ala Ala Ala Val Ala Ala Ala Pro Val Pro 
145 150 155 160 

Val Pro Ala Pro Ala Pro Val Pro Ala Pro lie Leu Ala Pro Ala Pro 

165 170 175 

Ala Glu Ser Pro Leu Pro Pro Pro Arg Ala Val Asp His Lys Pro Ser 

180 185 190 

Pro Asn Leu Pro Gin Arg Arg Glu His Thr Gly Thr Ser Asn Ser Ala 

195 200 205 

Ala Gly Arg Arg Ala Lys Gly His Leu Ala Cys Phe Leu Cys Ser Val 

210 215 220 

Leu Leu Phe Phe Leu Leu Arg Gin His Lys Ala 
225 230 235 



<210> 19 
<211> 1924 
<212> DNA 
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<213> Zea mays 

<220> 
<221> CDS 

<222> (16) . . . (1692) 
<400> 19 

cagcagcagc tgagg atg ggc gac gcg gcc ate gec gec gtg cat ttg cat 51 
Met Gly Asp Ala Ala lie Ala Ala Val His Leu His 
15 10 

gag tct gag gag gag cac ate ttc egg age egg ttc ccg ccc gtg gcc 99 
Glu Ser Glu Glu Glu His lie Phe Arg Ser Arg Phe Pro Pro Val Ala 
15 20 25 

gta cca gac gac gtc ace gtg ccg gag ttc gtg ctg gcg gac gcc gag 147 
Val Pro Asp Asp Val Thr Val Pro Glu Phe Val Leu Ala Asp Ala Glu 
30 35 40 

gcc tac gcg gac aag acg gcg etc gtg gag gcc gcg ccg ggt ggc egg 195 
Ala Tyr Ala Asp Lys Thr Ala Leu Val Glu Ala Ala Pro Gly Gly Arg 
45 50 55 60 

tec tac acc tac ggc gag ctg gtc egg gac gtg gcg egg ttc gcc agg 24 3 

Ser Tyr Thr Tyr Gly Glu Leu Val Arg Asp Val Ala Arg Phe Ala Arg 
65 70 75 

gcg ctg egg tec ate ggc gtc cgc agg ggc cac gtc gtg gtg gtc gcg 291 
Ala Leu Arg Ser lie Gly Val Arg Arg Gly His Val Val Val Val Ala 
80 85 90 

etc ccg aac ctg gcg gtg tac ccc gtg gtg age etc ggg ate atg tec 3 39 

Leu Pro Asn Leu Ala Val Tyr Pro Val Val Ser Leu Gly lie Met Ser 
95 100 105 

gcc gga gcg gtc ttc tec ggc gtg aac ccg cgc gcc gtc gcc gcc gag 387 
Ala Gly Ala Val Phe Ser Gly Val Asn Pro Arg Ala Val Ala Ala Glu 
110 115 120 

ate aag aag cag gtg gag gac tec gag gcc agg etc gtg gtc gcc gac 4 35 

lie Lys Lys Gin Val Glu Asp Ser Glu Ala Arg Leu Val Val Ala Asp 
125 130 135 140 

gcg gtg gcc tac gac aag gtg aag gac get ggc gtg ccg gtg ate ggc 48 3 

Ala Val Ala Tyr Asp Lys Val Lys Asp Ala Gly Val Pro Val He Gly 
145 150 155 

ate ggg gac gtg gcg egg ctt ccc ggc gcc ata ggc tgg gac gag etc 531 
He Gly Asp Val Ala Arg Leu Pro Gly Ala lie Gly Trp Asp Glu Leu 
160 165 170 
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etc gec atg gcg gac cgc gcg ggc gcg ccg gtg gtg gcg ctt gag ccg 579 

Leu Ala Met Ala Asp Arg Ala Gly Ala Pro Val Val Ala Leu Glu Pro 
175 180 185 

gcg cag cag tec gac ctg tgc gcg etc ccc tac teg tct ggt acg acg 627 
Ala Gin Gin Ser Asp Leu Cys Ala Leu Pro Tyr Ser Ser Gly Thr Thr 
190 195 200 

qgg gtg tec aag ggc gtg atg ctg age cac egg aac ctg gtg tec age 675 
Glv Val Ser Lys Gly Val Met Leu Ser His Arg Asn Leu Val Ser Ser 
205 210 215 220 

etc tgc tec tec atg ttc gec gtc ggg cag gag ctg gtc ggg cag gtg 723 
Leu Cys Ser Ser Met Phe Ala Val Gly Gin Glu Leu Val Gly Gin Val 
225 230 235 

gtc ace ctg ggc ctg atg ccc ttc ttc cac ate tac ggc ate acc ggc 
Val Thr Leu Gly Leu Met Pro Phe Phe His lie Tyr Gly lie Thr Gly 
240 245 250 



771 



ate tgc tgc gee acg ctg egg cac aag ggc acg gtg gtg gtg atg gac 819 
lie Cys cys Ala Thr Leu Arg His Lys Gly Thr Val Val Val Met Asp 
255 260 265 

cgc ttc gac ctg cgc gcg ttc ctg ggc gcg ctg ctg acg cac cgc gtc 867 
Arg Phe Asp Leu Arg Ala Phe Leu Gly Ala Leu Leu Thr His Arg Val 
270 275 280 

atg ttc gcg ccc gtc gtg ccg ccg gtc atg ctg gee atg gtg aag age 915 
Met Phe Ala Pro Val Val Pro Pro Val Met Leu Ala Met Val Lys Ser 
285 290 295 300 

ccc gtg gee gac gag ttc gac ctg tec ggc ctg gee etc agg tec gtc 96 3 

Pro val Ala Asp Glu Phe Asp Leu Ser Gly Leu Ala Leu Arg Ser Val 
305 310 315 

atg acg gee gee gcg ccg etc gcg ccg gac etc ctg gcg gcg ttc gag 1011 
Met Thr Ala Ala Ala Pro Leu Ala Pro Asp Leu Leu Ala Ala Phe Glu 
320 325 330 

cgc aag ttc ccg ggc gtg cag gtg gag gag gcg tac ggg etc acg gag 1059 
Arg Lys Phe Pro Gly Val Gin Val Glu Glu Ala Tyr Gly Leu Thr Glu 
335 340 345 

cac age tgc ate acg ctg acg cac gee age ggc ggc ggc gag gac gtg 1107 
His ser cys He Thr Leu Thr His Ala Ser Gly Gly Gly Glu Asp Val 
350 355 360 

ggg teg gcg gtg cag gtc gee aag aag aag teg gtc ggc ttc ate ctg 1155 
Glv Ser Ala Val Gin Val Ala Lys Lys Lys Ser Val Gly Phe He Leu 
365 370 375 380 
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ccc aac ctg gag gtg aag ttc gtg gac ccc gac acg ggg egg teg ctg 1203 

Pro Asn Leu Glu Val Lys Phe Val Asp Pro Asp Thr Gly Arg Ser Leu 
385 390 395 

ccc aag aac acg ccg ggg gag ate tgc gtg egg age cag gee gtg atg 1251 
Pro Lys Asn Thr Pro Gly Glu lie Cys Val Arg Ser Gin Ala Val Met 
400 405 410 

cag ggc tac tac agg aag aag gag gag acg gag cgc acc ate gac gee 1299 
Gin Gly Tyr Tyr Arg Lys Lys Glu Glu Thr Glu Arg Thr lie Asp Ala 
415 420 425 

gcg ggg tgg etc cac acg ggc gac gtc ggg tac ate gac gac gac ggc 134 7 

Ala Gly Trp Leu His Thr Gly Asp Val Gly Tyr lie Asp Asp Asp Gly 
430 435 440 

gac gtg ttc ate gtg gac cgc ate aag gag etc ate aag tac aag ggc 1395 
Asp Val Phe lie Val Asp Arg lie Lys Glu Leu lie Lys Tyr Lys Gly 
445 450 455 460 

ttc caa gtc gee cct gee gag ctg gag gee ate ctg ctg tct cac ccg 1443 
Phe Gin Val Ala Pro Ala Glu Leu Glu Ala lie Leu Leu Ser His Pro 
465 470 475 

tec gtc gag gac gee gee gtc ttc ggg ctg ccg gac gag gag gee ggc 1491 
Ser Val Glu Asp Ala Ala Val Phe Gly Leu Pro Asp Glu Glu Ala Gly 
480 485 490 

gag gtc ccg gcg teg tgc gtg gtg egg cga cgt ggc gcg ccg gag age 1539 
Glu Val Pro Ala Ser Cys Val Val Arg Arg Arg Gly Ala Pro Glu Ser 
495 500 505 

gag gcg gac atg atg gcg tac gtg gcg ggg cgc gtt gcg teg tac aag 1587 
Glu Ala Asp Met Met Ala Tyr Val Ala Gly Arg Val Ala Ser Tyr Lys 
510 515 520 

aag etc egg ctg ctg cgc ttc gtg gac gee ate ccc aag teg gtg tec 1635 
Lys Leu Arg Leu Leu Arg Phe Val Asp Ala lie Pro Lys Ser Val Ser 
525 530 535 540 

ggc aag ate ctg egg agg cag etc agg gac gag ttc gtc aag aag acg 1683 
Gly Lys lie Leu Arg Arg Gin Leu Arg Asp Glu Phe Val Lys Lys Thr 
545 550 555 

gca gca gcg taataatgea catcatcctg tgggtgggtg ettgettata 1732 
Ala Ala Ala 



ccagtgcaag atectgeatt cgccacttga tgaagacaat aatacaatta gggtacagtc 
agatgttcca agctactgat acaattgttg tttctgcaaa cagtactcca aactagtgea 



1792 
1852 
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tatacattgg cgttgtggac ccaaaaaaaa ctgctgttcc tgaataactt gagcttcccc 1912 
cagttccctc cc 1924 

<210> 20 
<211> 2122 
<212> DNA 
<213> Zea mays 

<220> 
<221> CDS 

<222> (156) ... (1820) 
<400> 20 

gtcgacccac gcgtccggcc ccggcgaggt cgcaccgcac cagccatcgt ctccttcctt 60 
ccttcctata ctaccatcca accagctgcc cagcgcaacg ttacctgccc gacatccgac 120 
aagccagtcc atccggcagc gagcaaaggt ctgag atg ggt tec gta gac gcg 173 

Met Gly ser Val Asp Ala 
1 5 

gcg ate gcg gtg ccg gtg ccg gcg gcg gag gag aag gcg gtg gag gag 221 
Ala lie Ala Val Pro Val Pro Ala Ala Glu Glu Lys Ala Val Glu Glu 
10 15 20 

aag gcg atg gtg ttc egg tec aag ctt ccc gac ate gag ate gac age 269 
Lys Ala Met Val Phe Arg Ser Lys Leu Pro Asp lie Glu lie Asp Ser 
25 30 35 

age atg gcg ctg cac acc tac'tgc ttc ggg aag atg ggc gag gtg gcg 317 
Ser Met Ala Leu His Thr Tyr Cys Phe Gly Lys Met Gly Glu Val Ala 
40 45 50 

gag egg gcg tgc ctg ate gac ggg ctg acg ggc gcg teg tac acg tac 365 
Glu Arg Ala Cys Leu lie Asp Gly Leu Thr Gly Ala Ser Tyr Thr Tyr 
55 60 65 70 

gcg gag gtg gag tec ctg tec egg cgc gee gcg teg ggg ctg cgc gee 413 
Ala Glu Val Glu Ser Leu Ser Arg Arg Ala Ala Ser Gly Leu Arg Ala 
75 80 85 

atg ggg gtg ggc aag ggc gac gtg gtg atg age ctg etc cgc aac tgc 461 
Met Gly Val Gly Lys Gly Asp Val Val Met Ser Leu Leu Arg Asn Cys 
90 95 100 

ccc gag ttc gec ttc acc ttc ctg ggc gee gee cgc ctg ggc gee gee 509 
Pro Glu Phe Ala Phe Thr Phe Leu Gly Ala Ala Arg Leu Gly Ala Ala 
105 no us 

acc acc acg gee aac ccg ttc tac acc ccg cac gag gtg cac cgc cag 557 
Thr Thr Thr Ala Asn Pro Phe Tyr Thr Pro His Glu Val His Arg Gin 
120 125 130 
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gcg gag gcg gcc ggc gcc agg etc ate gtg acc gag gec tgc gec gtg 605 

Ala Glu Ala Ala Gly Ala Arg Leu He Val Thr Glu Ala Cys Ala Val 
135 140 145 15Q 

gag aag gtg egg gag ttc gcg gcg gag egg ggc ate ccc gtg gtc acc 653 
Glu Lys Val Arg Glu Phe Ala Ala Glu Arg Gly He Pro Val Val Thr 
155 160 i 6 5 

gtc gac ggg cgc ttc gac ggc tgc gtg gag ttc gcc gag ctg ate acq 701 
Val Asp Gly Arg Phe Asp Gly Cys Val Glu Phe Ala Glu Leu He Ala 
170 175 i 80 

gcc gag gag ctg gag gcc gac gcc gac ate cac ccc gac gac gtc ate 749 
Ala Glu Glu Leu Glu Ala Asp Ala Asp He His Pro Asp Asp Val Val 
185 190 195 

gcg ctg ccc tac tec tec ggc acc acc ggg ctg ccc aag ggc ate ata 797 
Ala Leu Pro Tyr Ser Ser Gly Thr Thr Gly Leu Pro Lys Gly Val Met 
200 205 210 

etc acc cac cgc age etc ate acc age gtc gcg cag cag gtt gat ggc 845 
Leu Thr His Arg Ser Leu He Thr Ser Val Ala Gin Gin Val Asp Gly 
215 220 225 2 30 

gag aac ccg aac ctg tac ttc cgc aag gac gac gtg gtg ctg tgc ctg 893 
Glu Asn Pro Asn Leu Tyr Phe Arg Lys Asp Asp Val Val Leu Cys Leu 
235 240 245 

ctg ccg ctg ttc cac ate tac teg ctg aac teg gtg ctg ctg qcc aac 94 i 

Leu Pro Leu Phe His He Tyr Ser Leu Asn Ser Val Leu Leu Ala Gly 
250 255 260 

ctg cgc gcg ggc tec acc ate gtg ate atg cgc aag ttc gac ctg ggc 989 
Leu Arg Ala Gly Ser Thr He Val He Met Arg Lys Phe Asp Leu Gly 
265 270 275 

gcg ctg gtg gac ctg gtg cgc agg tac gtg ate acc ate gcg ccc ttc 1037 
Ala Leu Val Asp Leu Val Arg Arg Tyr Val He Thr He Ala Pro Phe 
280 285 290 

gtg ccg ccc ate gtg gtg gag ate gcc aag age ccc cgc gtg acc gcc 1085 
Val Pro Pro He Val Val Glu He Ala Lys ser Pro Arg Val Thr Ala 
295 300 305 310 

ggc gac etc gcg tec ate cgc atg gtc atg tec ggc gcc gcg ccc atg 1133 
Gly Asp Leu Ala Ser He Arg Met Val Met Ser Gly Ala Ala Pro Met 
315 320 325 

ggc aag gag etc cag gac gcc ttc atg gcc aag att ccc aat gcc gtg H81 
Gly Lys Glu Leu Gin Asp Ala Phe Met Ala Lys He Pro Asn Ala Val 
330 335 340 
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etc ggg cag ggg tac ggg atg acg gag gca ggc ccc gtg ctg gcg atg 1229 
Leu Gly Gin Gly Tyr Gly Met Thr Glu Ala Gly Pro Val Leu Ala Met 
345 350 355 

tac ctg gec ttc gec aag gag ccg tac ccg gtc aag tec ggg teg tgc 1277 
Cys Leu Ala Phe Ala Lys Glu Pro Tyr Pro Val Lys Ser Gly ser Cys 
360 365 370 

aqc acc gtg gtg egg aac gcg gag ctg aag ate gtc gac ccc gac acc 1325 
Glv Thr Val Val Arg Asn Ala Glu Leu Lys He Val Asp Pro Asp Thr 
375 380 385 390 

aac acc gee etc ggc egg aac cag ccc ggc gag ate tgc ate cgc ggg 1373 
Glv Ala Ala Leu Gly Arg Asn Gin Pro Gly Glu He Cys He Arg Gly 
395 400 405 

gag cag ate atg aaa ggt tac ctg aac gac ccc gag teg acg aag aac 14 21 

Glu Gin He Met Lys Gly Tyr Leu Asn Asp Pro Glu Ser Thr Lys Asn 
410 415 420 

acc ate gac aag gac ggc tgg ctg cac acc gga gac ate ggc tac gtg 1469 
Thr He Asp Lys Asp Gly Trp Leu His Thr Gly Asp He Gly Tyr Val 
425 430 435 

aac qac gac gac gag ate ttc ate gtc gac agg etc aag gag ate ate 1517 
Asp Asp Asp Asp Glu He Phe He Val Asp Arg Leu Lys Glu He He 
440 445 450 

aaa tac aag ggc ttc cag gtg ccg ccg gcg gag ctg gag gcg etc etc 1565 
Lys Tyr Lys Gly Phe Gin Val Pro Pro Ala Glu Leu Glu Ala Leu Leu 
455 ~ 460 465 470 

ate acg cac ccg gag ate aag gac gee gee gtc gtc tea atg aac gat 1613 
He Thr His Pro Glu He Lys Asp Ala Ala Val Val Ser Met Asn Asp 
475 480 485 

gac ctt get ggt gaa ate ccg gtc gee ttc ate gtg egg acc gaa ggt 1661 
Asp Leu Ala Gly Glu He Pro Val Ala Phe He Val Arg Thr Glu Gly 
490 495 500 

tct caa gtc acc gag gat gag ate aag caa ttc gtc gee aag gag gtg 1709 
Ser Gin Val Thr Glu Asp Glu He Lys Gin Phe Val Ala Lys Glu Val 
505 510 515 

qtt ttc tac aag aag ate cac aag gtc ttc ttc acc gaa tec ate ccc 1757 
Val Phe Tyr Lys Lys He His Lys Val Phe Phe Thr Glu Ser He Pro 
520 525 530 

aag aac ccg teg ggc aag ate ctg agg aag gac ttg aga gee agg etc 1805 
Lys Asn Pro Ser Gly Lys He Leu Arg Lys Asp Leu Arg Ala Arg Leu 
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535 540 545 550 

gcc gcc ggt gtt cag tgaggcccgt atgcagcttc ttctcagacg cccaccacac 1860 
Ala Ala Gly Val Gin 
555 

tgctgcgaaa aaaaaaggcg atgtaatggc ggtaacacta ctattcatga acaggcaaca 1920 

gaagggacac gtattcctgt ggaacacatg ttgccagaaa gaggttttag tttgtctgtt 1980 

tgttggccct gtgtgtaatg ttcataaacg atatagacgt ctctctatgt tgtttttgac 2040 

ccaaagaaca agcctgatta cggcaataca gctatggagt accgtgtttc ataaaaaaaa 2100 

aaaaaaaaaa aagggcggcc gc 2122 

<210> 21 
<211> 1970 
<212> DNA 
<213> Zea mays 

<220> 
<221> CDS 

<222> (43) • . . (1767) 
<400> 21 

ccacgcgtcc gcactggtcc ggattcgcag agcaaaacgg aa atg ate acg gtg 54 

Met lie Thr Val 

1 

gca gca ccg gag gcg cag ccg cag gtg gcg gcg gcg gcg gcg gtt gcg 102 
Ala Ala Pro Glu Ala Gin Pro Gin Val Ala Ala Ala Ala Ala Val Ala 
5 10 15 20 

gcg ccg gag gag acc gtg ttc egg teg aag ctg ccg gac ate gac ate 150 
Ala Pro Glu Glu Thr Val Phe Arg Ser Lys Leu Pro Asp He Asp He 
25 30 35 

ccc acc cac ctg ccg etc cac gac tac tgc ttc teg agg gcg gcg gag 198 
Pro Thr His Leu Pro Leu His Asp Tyr Cys Phe Ser Arg Ala Ala Glu 
40 45 50 

gcg gcg ggc gcg ccg tgc etc ate gcg gcg gcc acg ggg agg acc tac 246 
Ala Ala Gly Ala Pro Cys Leu lie Ala Ala Ala Thr Gly Arg Thr Tyr 
55 60 65 

acg tac gcc gag acg cgc etc ctg tgc cgc aag gcc gcg gcg tgc etc 294 
Thr Tyr Ala Glu Thr Arg Leu Leu Cys Arg Lys Ala Ala Ala Cys Leu 
70 75 80 

cac ggg etc ggc gtc gcc cag ggc gac cgc gtc atg etc ctg etc cag 342 
His Gly Leu Gly Val Ala Gin Gly Asp Arg Val Met Leu Leu Leu Gin 
85 90 95 100 

aac tec gtc gag ttc gtg etc gcc ttc ttc ggc gcg tec ttc etc ggc 390 
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Asn Ser Val Glu Phe Val Leu Ala Phe Phe Gly Ala Ser Phe Leu Gly 

105 110 115 

gcc gtc acc acg gcc gcc aac cca ttc tgc acg ccg cag gag ate cac 4 38 

Ala Val Thr Thr Ala Ala Asn Pro Phe Cys Thr Pro Gin Glu lie His 
120 125 130 

aag cag ttc age gcc tec ggc gcg aag gtc gtc gtc acc cac tec gcc 486 
Lys Gin Phe Ser Ala Ser Gly Ala Lys Val Val Val Thr His Ser Ala 
135 140 145 

tac gtc gcc aag etc egg cac ggc gcc ttc ccg agg ate ggc acg gtg 534 
Tyr Val Ala Lys Leu Arg His Gly Ala Phe Pro Arg lie Gly Thr Val 
150 155 160 

age ggc ggc ggc gtg gac ggc aat gcc etc etc acc gtc etc acc ate 582 
Ser Gly Gly Gly Val Asp Gly Asn Ala Leu Leu Thr Val Leu Thr lie 

165 170 175 180 

gac ggc gac gcg gcc gac acc ccg gaa ggc tgc ctg gcg ttc tgg gag 630 
Asp Gly Asp Ala Ala Asp Thr Pro Glu Gly Cys Leu Ala Phe Trp Glu 
185 190 195 

ctg etc acg tec ggc gac ggc gac gcc etc ccg gag gtg tec ate tec 678 
Leu Leu Thr Ser Gly Asp Gly Asp Ala Leu Pro Glu Val Ser lie Ser 
200 205 210 

ccg gac gac ccc gtg gcg ctg ccg ttc teg teg ggc acc acg ggg ctg 726 
Pro Asp Asp Pro Val Ala Leu Pro Phe Ser Ser Gly Thr Thr Gly Leu 
215 220 225 

ccc aag ggc gtc gtg ctg acc cac ggc ggc cag gtc acg aac gtg gcg 774 
Pro Lys Gly Val Val Leu Thr His Gly Gly Gin Val Thr Asn Val Ala 
230 235 240 

cag cag gtg gac ggc gcg aac ccc aac ctg tac atg egg gag ggc gac 82 2 

Gin Gin Val Asp Gly Ala Asn Pro Asn Leu Tyr Met Arg Glu Gly Asp 
245 250 255 260 

gtc gcg etc tgc gtg ctg cct ctg ttc cac ate ttc tec etc aac tec 870 
Val Ala Leu Cys Val Leu Pro Leu Phe His lie Phe Ser Leu Asn Ser 
265 270 275 

gtg ctg etc tgc gcc atg egg gcc ggc gcg gcg gtc atg etc atg ccc 918 
Val Leu Leu Cys Ala Met Arg Ala Gly Ala Ala Val Met Leu Met Pro 
280 285 290 

aag ttc gag atg ggc gcc atg ctg gag ggc ate cag egg tgg cgc gtc 966 
Lys Phe Glu Met Gly Ala Met Leu Glu Gly lie Gin Arg Trp Arg Val 
295 300 305 
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aca gcg gca gcc g-cc gug ccg ccg ctg gtg etc gec ctg gec aag aac 
Thr Val Ala Ala Val Val Pro Pro Leu Val Leu Ala Leu Ala Lys Asn 
310 315 320 

ccc gcg etc gag aag tac gac etc age tec ate egg ate gtg etc tec 
Pro Ala Leu Glu Lys Tyr Asp Leu Ser Ser lie Arg lie Val Leu Ser 
325 330 335 340 

ggc gcc gcg ccg ctt ggc aag gac etc gtc gac gca etc cgc gcc cgc 
Gly Ala Ala Pro Leu Gly Lys Asp Leu Val Asp Ala Leu Arg Ala Arg 
345 350 355 

gtg cca cag gcc gtc ttc gga cag gga tac ggg atg acg gag gcc ggg 
Val Pro Gin Ala Val Phe Gly Gin Gly Tyr Gly Met Thr Glu Ala Gly 
360 365 370 

ccc gtg ctg tec atg tgc ccg gcg ttc gcc aag gag ccg gcg ccc gcc 
Pro Val Leu Ser Met Cys Pro Ala Phe Ala Lys Glu Pro Ala Pro Ala 



aag ccg ggg teg tgc ggc aca gtg gtg cgc aac gcg gag etc aag gtg 
Lys Pro Gly Ser Cys Gly Thr Val Val Arg Asn Ala Glu Leu Lys Val 
390 395 400 

gtg gac ccg gac acg ggc etc tec etc ggc cgc aac etc ccc ggc gag 
Val Asp Pro Asp Thr Gly Leu Ser Leu Gly Arg Asn Leu Pro Gly Glu 
405 * 410 415 420 

ate tgc ate egg ggc ccg cag ate atg aaa ggg tac ctg aac gac ccg 
lie Cys lie Arg Gly Pro Gin lie Met Lys Gly Tyr Leu Asn Asp Pro 
425 430 435 

gag gcc ace gcg agg acg ate gac gtc cac ggc tgg etc cac acc ggc 
Glu Ala Thr Ala Arg Thr lie Asp Val His Gly Trp Leu His Thr Gly 
440 445 450 

gac ate ggc tac gtc gac gac gac gac gag gtc ttc ate gtc gac cgc 
Asp lie Gly Tyr Val Asp Asp Asp Asp Glu Val Phe lie Val Asp Arg 
455 460 465 

gtc aag gag etc ate aag ttc aag ggc ttc cag gtg ccg ccg gcc gag 
Val Lys Glu Leu He Lys Phe Lys Gly Phe Gin Val Pro Pro Ala Glu 
470 475 480 

etc gag get ctg etc gtc gcc cac ccg tec ate gcc gac gcg gcc gtc 
Leu Glu Ala Leu Leu Val Ala His Pro Ser lie Ala Asp Ala Ala Val 
485 490 495 500 

gtc ccg caa aag gac gaa gcc gcc ggc gag gtc ccc gtc gcc ttc gtg 
Val Pro Gin Lys Asp Glu Ala Ala Gly Glu Val Pro Val Ala Phe Val 



1014 
1062 
1110 
1158 
1206 
1254 
1302 
1350 
1398 
1446 
1494 
1542 
1590 



505 



510 



515 
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gtc cgc gcc gcc gac gcc gac ate gcg gag gac gec ate aag gag ttc 16 3 8 

Val Arg Ala Ala Asp Ala Asp lie Ala Glu Asp Ala lie Lys Glu Phe 
520 525 530 

ate tec aag cag gtg gta tta tac aag agg at a cac aag gtg tac ttc 1686 
lie ser Lys Gin Val Val Leu Tyr Lys Arg lie His Lys Val Tyr Phe 
535 540 545 

ace ccc tec ate ccc aag teg gcg tec ggg aag ate ctg agg agg gag 1734 
Thr Pro Ser lie Pro Lys Ser Ala Ser Gly Lys lie Leu Arg Arg Glu 
550 555 560 

ctg cgc gcc aag etc gcg gca gcc gcg ace gcc tgaggagctt gaegctcagg 1787 
Leu Arg Ala Lys Leu Ala Ala Ala Ala Thr Ala 
565 570 575 

ttccatgcct tgctacgtgc aactcgtttg gtagtggcgt ttactgccaa tccagtgtat 1847 

gggtacaggc gtgaaacatg aggtaccgag aatgtactat caatatatgc tttgtatgcg 1907 

agtgcttcgt caattaagga ccaccattta ttcccgtcat gattttattt ccaaaaaaaa 1967 

aaa 1970 

<210> 22 

<211> 1269 

<212> DNA 

<213> Zea mays 

<220> 
<221> CDS 

<222> (56). . .(1117) 
<400> 22 

actagtggat ccaaagaatt eggcacgagg agagatcagc gacttaacaa caata atg 58 

Met 
1 

gcc acc gcc ata gtt ccc acg gac gcc gag ctg ctg cag gcg cag gcc 106 
Ala Thr Ala lie Val Pro Thr Asp Ala Glu Leu Leu Gin Ala Gin Ala 
5 10 15 

gac ctg tgg cgc cac age etc tac tac ctg aca tec atg gcg etc aag 154 
Asp Leu Trp Arg His Ser Leu Tyr Tyr Leu Thr Ser Met Ala Leu Lys 
20 25 30 

tgc gcg gtg gag etc cac ate ccg acc gcc ate cac aac eta ggc ggg 202 
Cys Ala Val Glu Leu His lie Pro Thr Ala lie His Asn Leu Gly Gly 
35 40 45 

tct gcc acg ctg ccg gac etc gtg gcc gcg ctg tec ctg cca gcg gcc 250 
Ser Ala Thr Leu Pro Asp Leu Val Ala Ala Leu Ser Leu Pro Ala Ala 
50 55 60 65 
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aag etc ccg ttc etc ggg cgc gtg atg egg ctg ctg gtc acg teg ggc 298 

Lys Leu Pro Phe Leu Gly Arg Val Met Arg Leu Leu Val Thr Ser Gly 
70 75 80 

gtc ttc gcg teg tec gac gac gtg cag tac egg ctg aac ccg ctg tec 346 
Val Phe Ala Ser Ser Asp Asp Val Gin Tyr Arg Leu Asn Pro Leu Ser 
85 90 95 

tgg ctg ctg gtg gag ggc gtg gag teg gag gac cac acc tac cag aag 394 
Trp Leu Leu Val Glu Gly Val Glu Ser Glu Asp His Thr Tyr Gin Lys 
100 105 110 

tac ttc gtg ctg ggc acc gtc tec cgc cac tac gtg gag gee ggc atg 442 
Tyr Phe Val Leu Gly Thr Val Ser Arg His Tyr Val Glu Ala Gly Met 
115 120 125 

tec ctg gee gac tgg ttc aag aag gag gag gac gag gac cgc cag ctg 490 
Ser Leu Ala Asp Trp Phe Lys Lys Glu Glu Asp Glu Asp Arg Gin Leu 
130 135 140 " 145 

ccg teg ccg ttc gag gee ctg cac ggg gtg ccc etc gtc cac gag age 538 
Pro Ser Pro Phe Glu Ala Leu His Gly Val Pro Leu Val His Glu Ser 
150 155 160 

acc aag ctg ctg gac gag gag ctg gac agg gtc gtg gag gaa ggc gtg 586 
Thr Lys Leu Leu Asp Glu Glu Leu Asp Arg Val Val Glu Glu Gly Val 
165 170 175 

gee gcg cac gac aac ctg gee ate ggg acc gtc ata egg gag tgc ggc 634 
Ala Ala His Asp Asn Leu Ala lie Gly Thr Val lie Arg Glu Cys Gly 
180 185 190 

gee gac gtc ttc age ggc etc cgc teg etc acc tac tgc tgc ggc agg 68 2 

Ala Asp Val Phe Ser Gly Leu Arg Ser Leu Thr Tyr Cys Cys Gly Arg 
195 200 205 

cag ggg aac gee age gcg gee gee ate gtc aag gee ttc cca gac ate 7 30 

Gin Gly Asn Ala Ser Ala Ala Ala lie Val Lys Ala Phe Pro Asp lie 
210 215 220 225 

aag tgc acc gtg etc aac ctt ccc agg gtc gtc gag gag acg acg acc 778 
Lys Cys Thr Val Leu Asn Leu Pro Arg Val Val Glu Glu Thr Thr Thr 
230 235 240 

aag acc ate acc ate ccg cct gcg cag get gtc atg etc aag etc gtc 826 
Lys Thr lie Thr lie Pro Pro Ala Gin Ala Val Met Leu Lys Leu Val 
245 250 255 

ctg cac ttc tgg age gac gac gac tgc gtc aag ate ctg gag ctg tgc 87 4 

Leu His Phe Trp Ser Asp Asp Asp Cys Val Lys lie Leu Glu Leu Cys 
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260 265 270 

agg aag gcc ate cct tec cgc caa gaa gga ggg aag gtg ate ate att 92 2 

Arg Lys Ala lie Pro Ser Arg Gin Glu Gly Gly Lys Val lie lie lie 
275 280 ^ 285 

gag ata etc ctg ggc ccg tac atg ggg ccg gtc atg tac gag gcc cag 970 
Glu lie Leu Leu Gly Pro Tyr Met Gly Pro Val Met Tyr Glu Ala Gin 
290 295 300 305 

ctg ctg atg gac atg etc atg atg gtg aac ace aag ggc agg cag cgc 1018 
Leu Leu Met Asp Met Leu Met Met Val Asn Thr Lys Gly Arg Gin Arg 
310 315 320 

ggc gaa gac gac tgg cgc cac ate ttt acc aag get ggc ttc tec gac 1066 
Gly Glu Asp Asp Trp Arg His He Phe Thr Lys Ala Gly Phe Ser Asp 
325 330 335 

tac aag gtt gtc aag aaa ate gga get cgt ggt gtc ate gag gtc tac 1114 
Tyr Lys Val Val Lys Lys He Gly Ala Arg Gly Val lie Glu Val Tyr 
340 345 350 



cca tgatccatga tcgatgtcat gtgactgtga gaggacgata ctgtacaatt 1167 
aaataaaegg ggtatctagc tactactcag cttttgtacc tcgagatcca tgcatgttaa 1227 
ttacttgett ccatctgttt teaaaatgea tctatgtaat gt 1269 

<210> 23 
<211> 1412 
<212> DNA 
<213> Zea mays 

<220> 
<221> CDS 

<222> (139) ... (1263) 
<400> 23 

gtcgacccac gcgtccgcca ggttccattc gtctctgcag tctcacccac aagagacaca 60 
aacctagcgc aacaagcaat cgaaaaagag atttggctac aaccaattaa ccattggcca 120 
gcagtgtacg tgggaacg atg gcc etc atg cag gag agt agt age cag gat 171 

Met Ala Leu Met Gin Glu Ser Ser Ser Gin Asp 
1 5 10 

ttg etc caa get cac gac gag etc ttg cac cat tec ctg tgc ttc gcc 219 
Leu Leu Gin Ala His Asp Glu Leu Leu His His Ser Leu Cys Phe Ala 
15 20 25 

aaa teg etc gcg etc gcc gtg gcg ctg gac etc cgc ate ccc gac gcg 267 
Lys Ser Leu Ala Leu Ala Val Ala Leu Asp Leu Arg lie Pro Asp Ala 
30 35 40 

ate cac cac cac ggg gcc ggc ggc gcc acc ctt etc cag ate etc gcc 315 
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He His His His Gly Ala Gly Gly Ala Thr Leu Leu Gin He Leu Ala 
45 50 



55 



gag act gcg etc cac cca age aag ctt cgc gee ctt cac can a-n 
Glu Thr Ala Leu His Pro Ser Lys Leu Arg La Leu A?g Arg Leu Me? 
60 65 70 ?5 



cgc gtg etc acc gtc acg ggc ate ttc age gtc ate aaa <~a« rv~*. ^= 
Arg Val Leu Thr Val Thr Gly He Phe sir Sal &f S2 Gin Pro Pro 
80 85 90 

gca ggt ggt ggt gat gat tea acc gtc cac acg teg gac aac aaa act 
Ala Gly Gly Gly Asp Asp Ser Thr Val His Thr Ser Isp lip £! K 
95 100 105 

gtc gtc gtc tac agg ttg acg gca acc tec cac t-t-o 

Val val Val Tyr Arg Leu Thr La Ala ler Arg P„e Leu 5S Ilr lip 

115 120 

gac gtg age acg gcg acc ttg get ccc ttt gtg agt ctg aca etc caa r« 
Asp Val ser Thr Ala Thr Leu Ala Pro Phe Val sir Leu 111 Su all 5 
J. ZD 13 0 



135 



p™ ?? C o 9t S Cg C ? C gCC Ctg agt atc tcc ^g tgg ttc egg 

Pro lie Ala Ala Cys Pro His Ala Leu Gly He Ser All T?p Phe A?g 

145 150 155 

cag gag cag cac gag ccg tec ccg tat ggc ctg gcg ttc cac caa acc 
Gin Glu Gin His Glu Pro Ser Pro Tyr Gly LeS LI p£e tig Gin Thr 
160 165 * 170 



411 



459 



507 



603 



651 



Pro Thr Se 82 Glu S?« ??* 1™ ^ gt * * aC gCC tta cta aac aaa 699 
Pro Thr lie Trp Glu His Ala Asp Asp Val Asn Ala Leu Leu Asn Lys 

175 180 185 ys 

Glv SE? ??» ?? 9 ? aC ag ° Cg ° ttc ctc ata cca at * gtg ctg agg gag 
Gly Met Ala Ala Asp Ser Arg Phe Leu Met Pro He Val Leu Arg Glu 

190 195 200 

& §?f SS S S S Sf S S S 3 !S E ?S 15 8g 

215 

E E 85 15 E If. E S IS E E E £ E S SS 

225 2 3 0 235 

SI Ss sir val Leu 11° T CtC S Cg °? C gtt gtc * cc * ct tct 891 

Lys cys Ser Val Leu Asp Leu Pro His Val Val Ala Gly Ala Pro Ser 



747 



795 



843 



240 245 * 2 5 0 
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gat ggc aac gtg cag ttc gtc gca ggc aat atg ttt gag agt att cca 939 

Asp Gly Asn Val Gin Phe Val Ala Gly Asn Met Phe Glu Ser lie Pro 

255 260 265 

cct gca acc get gtt ttc etc aag aaa act eta cat gac tgg ggt gac 987 
Pro Ala Thr Ala Val Phe Leu Lys Lys Thr Leu His Asp Trp Gly Asp 
270 275 280 

gat gag tgt gtc aag ata ttg aag aat tgc aag caa gee ata tct cca 1035 
Asp Glu Cys Val Lys lie Leu Lys Asn Cys Lys Gin Ala lie Ser Pro 
285 290 295 

egg gat gca ggt ggg aag gta ata ate ttg gat gtg gta gtt gga tat 1083 
Arg Asp Ala Gly Gly Lys Val lie lie Leu Asp Val Val Val Gly Tyr 
300 305 310 315 

aaa cag tea aac ata aag cat caa gag aca caa gtt atg ttt gat ttg 1131 
Lys Gin Ser Asn lie Lys His Gin Glu Thr Gin Val Met Phe Asp Leu 
320 325 330 

tat atg atg gcg gtt aac gga gtt gag cgt gac gag caa gag tgg aag 1179 
Tyr Met Met Ala Val Asn Gly Val Glu Arg Asp Glu Gin Glu Trp Lys 
335 340 345 

aag ate ttc act gaa get gga ttc aaa gac tac aaa att eta ccc gtc 1227 
Lys lie Phe Thr Glu Ala Gly Phe Lys Asp Tyr Lys lie Leu Pro Val 
350 355 360 

att ggt gat gta teg gtc ate ate gag gtc tat cct tgaatgcttt 1273 
lie Gly Asp Val Ser Val He He Glu Val Tyr Pro 
365 370 375 

gtgaacaaag gcctccataa taaactgaag accaagaggt gttgatagta tattatgaat 13 33 
tgttatttgt cctgtacact tgattctttg cgtatttgta atgacaagtt gagtaaaaaa 1393 
aaaaaaaaag ggcggccgc 1412 

<210> 24 

<211> 1315 

<212> DNA 

<213> Zea mays 

<220> 
<221> CDS 

<222> (51) .(1160) 
<400> 24 

ccacgcgtcc gagecaacta gcagtatata cgttggcacg cacgaatacg atg gca 56 

Met Ala 
1 

etc atg caa gag age agt age cag gac cag gac atg etc caa get cac 104 
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Leu Met Gin Glu Ser Ser Ser Gin Asp Gin Asp Met Leu Gin Ala His 
5 10 15 

gac gag etc ttg cac cat tec ttg tgc ttc gec aaa teg etc gcg etc 152 
Asp Glu Leu Leu His His Ser Leu Cys Phe Ala Lys Ser Leu Ala Leu 
20 25 30 

ace gtg gcg ctg gac etc cgc ate cca gac gee ate cac cac cac ggc 200 
Thr Val Ala Leu Asp Leu Arg lie Pro Asp Ala lie His His His Gly 
35 40 45 50 

ggc ggc gee ace ctt etc cag ate etc gcg gag act ggg etc cac cca 248 
Gly Gly Ala Thr Leu Leu Gin lie Leu Ala Glu Thr Gly Leu His Pro 
55 60 65 

age aag ctt cgc gee eta cgc cgc etc atg cgc gtg etc ace gtc acg 296 
Ser Lys Leu Arg Ala Leu Arg Arg Leu Met Arg Val Leu Thr Val Thr 
70 75 80 

ggc ace ttc age gtc cag gtc cag caa cca cca gee ggt agt gac gac 344 
Gly Thr Phe Ser Val Gin Val Gin Gin Pro Pro Ala Gly Ser Asp Asp 
85 90 95 

gac gaa get gtc gtc gtc tac agg ctg aca gca gee tec cgc ttc etc 39 2 

Asp Glu Ala Val Val Val Tyr Arg Leu Thr Ala Ala Ser Arg Phe Leu 
100 105 110 

gtc age gac gag gtg age acg gca aca ace ttg get ccc ttt gtg age 440 
Val Ser Asp Glu Val Ser Thr Ala Thr Thr Leu Ala Pro Phe Val Ser 
115 120 125 130 

ctg gcg etc cag cct ate get gee tct ccg cac gee eta ggc ate tgc 488 
Leu Ala Leu Gin Pro lie Ala Ala Ser Pro His Ala Leu Gly lie Cys 
135 140 145 

gcg tgg ttt egg cag gag cag cac gag ccg tec ccg tat ggc ctg gca 536 
Ala Trp Phe Arg Gin Glu Gin His Glu Pro Ser Pro Tyr Gly Leu Ala 
150 155 160 

ttc cgc cag acc cca acg etc tgg gaa cat get gac gac gta aac gee • 584 
Phe Arg Gin Thr Pro Thr Leu Trp Glu His Ala Asp Asp Val Asn Ala 
165 170 175 

tta ctg aac aaa ggc atg gtg gcg gac age cgc ttc ctg atg cca att 63 2 

Leu Leu Asn Lys Gly Met Val Ala Asp Ser Arg Phe Leu Met Pro lie 
180 185 190 

gtg etc agg cag tgc ggc gag atg ttt cgt ggg ate aac tea ttg gtt 680 
Val Leu Arg Gin Cys Gly Glu Met Phe Arg Gly lie Asn Ser Leu Val 
195 200 205 210 
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gac gtc ggc ggt ggg cat ggt ggc gcc gcc gcc gcc ate gec get gec 7 28 

Asp Val Gly Gly Gly His Gly Gly Ala Ala Ala Ala lie Ala Ala Ala 
215 220 225 

ttc ccg cac gtc aag tgc age gtg ctt gac etc ccg cac gtt gtc gcc 776 

Phe Pro His Val Lys Cys Ser Val Leu Asp Leu Pro His Val Val Ala 
230 235 240 

ggt get cca tct gat ggc aac gtg cag ttc gtc gca gga aat atg ttt 824 

Gly Ala Pro Ser Asp Gly Asn Val Gin Phe Val Ala Gly Asn Met Phe 
245 250 255 

gag agt att cca cct gca acc get gtt ttc etc aag aaa act eta cat 872 

Glu Ser lie Pro Pro Ala Thr Ala Val Phe Leu Lys Lys Thr Leu His 
260 265 270 

gac tgg ggt gac gat gag tgt gtc aag at a ttg aag aat tgc aag caa 920 

Asp Trp Gly Asp Asp Glu Cys Val Lys He Leu Lys Asn Cys Lys Gin 

275 280 285 290 

gcc at a cct cca egg gat gca ggt gga aag gta at a ate ttg gac gtg 968 

Ala He Pro Pro Arg Asp Ala Gly Gly Lys Val He lie Leu Asp Val 
295 300 305 

gta gtt gga tat aaa cag tea aac ata aag cat caa gag aca caa gtt 1016 

Val Val Gly Tyr Lys Gin Ser Asn He Lys His Gin Glu Thr Gin Val 
310 315 320 

atg ttc gat ttg tat atg atg gcc gtt aac gga gtt gag cgt gac gag 1064 

Met Phe Asp Leu Tyr Met Met Ala Val Asn Gly Val Glu Arg Asp Glu 
325 330 335 

caa gag tgg aag aag ate ttc gcc gaa gcc gga ttc aaa gac tac aaa 1112 

Gin Glu Trp Lys Lys He Phe Ala Glu Ala Gly Phe Lys Asp Tyr Lys 
340 345 350 

att eta ccc gtc att ggt gac gtg teg gtc ate ate gag gtc tat cct 1160 

He Leu Pro Val lie Gly Asp Val Ser Val He He Glu Val Tyr Pro 

355 360 365 370 

tgaatgcttt atttgtgaat aataaagggc gectaattea taataaacct aggattgtga 1220 
aggcgctggt attacattaa gaattgttcc tttttattac catgtgcttg aacctttgga 1280 
caatttgtaa tatgagaagg tgagcaattg tgttt ^ 1315 

<210> 25 
<211> 1306 
<212> DNA 
<213> Zea mays 



<220> 
<221> CDS 
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<222> (57) . . . (1154) 
<400> 25 

ccacgcgtcc ggagaattag ctatagccag ccgtgtgtcg aacacgacac gcgata atg 59 

Met 
1 

gca etc atg cag gag age age cag gac ttg etc gaa gcg cac gac gag 107 
Ala Leu Met Gin Glu Ser Ser Gin Asp Leu Leu Glu Ala His Asp Glu 
5 10 15 

etc ttc cac cac tgc ctg tgc ttc gec aaa teg etc gcg etc gee gtg 155 
Leu Phe His His Cys Leu Cys Phe Ala Lys Ser Leu Ala Leu Ala Val 
20 25 30 

gcg cag gac etc cgc ate ccc gac gcg ate cac cac cac gga ggc ggc 20 3 

Ala Gin Asp Leu Arg lie Pro Asp Ala lie His His His Gly Gly Gly 
35 40 45 

gee acc etc cac cag ate etc gee gag gee gcg etc cac cca age aag 251 
Ala Thr Leu His Gin lie Leu Ala Glu Ala Ala Leu His Pro Ser Lys 
50 55 60 65 

ctt cgc gee eta cgc cgc ctg atg cgc gtg etc acc gtc teg ggc gtc 299 
Leu Arg Ala Leu Arg Arg Leu Met Arg Val Leu Thr Val Ser Gly Val 
70 75 80 

ttc acc gtc cag tat tct tea acc gtc gac gcg teg gac gga get gat 347 
Phe Thr Val Gin Tyr Ser Ser Thr Val Asp Ala Ser Asp Gly Ala Asp 
85 90 95 

gtc gtc tac agg ctg acg gca gec tec cgc ttc etc gtc age gat age 395 
Val Val Tyr Arg Leu Thr Ala Ala Ser Arg Phe Leu Val Ser Asp Ser 
100 105 110 

gac gag gcg ggc acg gcg tec ttg get ccc ttt gcg aac ctg gcg etc 443 
Asp Glu Ala Gly Thr Ala Ser Leu Ala Pro Phe Ala Asn Leu Ala Leu 
115 120 125 

cac cct ate gec ate tec ccg cac gee gtg ggc ate tgc gcg tgg ttc 491 
His Pro lie Ala lie Ser Pro His Ala Val Gly lie Cys Ala Trp Phe 
130 135 140 145 

egg cag gag cag cac gac ccg tec ccg tac ggc ctg gcg ttc cgc cag 53 9 

Arg Gin Glu Gin His Asp Pro Ser Pro Tyr Gly Leu Ala Phe Arg Gin 
150 155 160 

ate ccg acc ate tgg gag cat get gac aac gta aac gee eta ctg aac 587 
lie Pro Thr lie Trp Glu His Ala Asp Asn Val Asn Ala Leu Leu Asn 
165 170 175 
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aaa ggc ttg etc gcg gaa age cgc ttc ttg atg cca ate gta etc agg 63 5 

Lys Gly Leu Leu Ala Glu Ser Arg Phe Leu Met Pro lie Val Leu Arg 
180 185 190 

gag tgc gga gac gag gtg ttc cgt ggg ate gac teg ttg gtc gac gtc 683 
Glu Cys Gly Asp Glu Val Phe Arg Gly lie Asp Ser Leu Val Asp Val 
195 200 205 

ggc ggt ggg cac ggt ggc gee gee gee ace ate gee gec gca ttc ccg 7 31 

Gly Gly Gly His Gly Gly Ala Ala Ala Thr lie Ala Ala Ala Phe Pro 
210 215 220 225 

cac gtc aag tgc age gtg ctt gac etc ccg cac gtt gtc gee ggt get 779 
His Val Lys Cys Ser Val Leu Asp Leu Pro His Val Val Ala Gly Ala 
230 235 240 

cca tec gat gee tgc gtg cag ttc gtt gcg ggc aat atg ttc cac agt 827 
Pro Ser Asp Ala Cys Val Gin Phe Val Ala Gly Asn Met Phe His Ser 
245 250 255 

att cca cct gca acc gee gtt ttc ttc aag aca act eta tgt gac tgg 875 
lie Pro Pro Ala Thr Ala Val Phe Phe Lys Thr Thr Leu Cys Asp Trp 
260 265 270 

ggt gac gac gag tgc ate aag ata ttg aag aat tgc aag caa gee ata 923 
Gly Asp Asp Glu Cys lie Lys lie Leu Lys Asn Cys Lys Gin Ala lie 
275 280 285 

tct cca egg gat gag ggt ggg aag gtg ata ate atg gac gtg gta gtc 971 
Ser Pro Arg Asp Glu Gly Gly Lys Val lie He Met Asp Val Val Val 
290 295 300 305 

ggg tat ggg cag tea aac atg aag cgc eta gag aca caa gtt atg ttt 1019 
Gly Tyr Gly Gin Ser Asn Met Lys Arg Leu Glu Thr Gin Val Met Phe 
310 315 320 

gat ttg gtt atg atg gcg gtc aat gga gtc gag cgc gac gag caa gag 1067 
Asp Leu Val Met Met Ala Val Asn Gly Val Glu Arg Asp Glu Gin Glu 
325 330 335 

tgg aag gag atg ttc att gaa get gga ttc aaa gac tac aaa ate cga 1115 
Trp Lys Glu Met Phe He Glu Ala Gly Phe Lys Asp Tyr Lys He Arg 
340 345 350 

cca gta get ggc etc atg teg gtc ate gag gtc tat cca tgaattcttt 1164 
Pro Val Ala Gly Leu Met Ser Val He Glu Val Tyr Pro 
355 360 365 

gtgaacaaaa ggccggctgc cataatataa actgaagacc aegaegtegt catggagctg 12 24 
agcgtgttgt tttttagact actaggcact tgagcctctg agaatttgta ataataaata 1284 
agctgagcaa cagcgttctg tt 13 06 
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<210> 26 
<211> 1794 
<212> DNA 
<213> Zea mays 

<220> 
<221> CDS 

<222> (80) . • . (1594) 
<400> 26 

gcgccacgcc acacaaaaag cagtgactga gtgagaaagg acagcaagca agccctccct 60 
ccatctccgc cccgtcgcc atg gtc ctt etc ttc gtg gag aag etc ctg gtc 112 

Met Val Leu Leu Phe Val Glu Lys Leu Leu Val 
1 5 io 



ggc etc ttg gcg tec gtc atg gtc gee ate gcg gtg tec aag ate cgt 

Gly Leu Leu Ala Ser Val Met Val Ala He Ala Val Ser Lys He Ara 

15 20 25 

ggc cgc aag etc egg ctg cct ccc ggc ccc gtc ccc gtg ccc gtc ttc 

Gly Arg Lys Leu Arg Leu Pro Pro Gly Pro Val Pro Val Pro Val Phe 

30 35 40 



160 



208 



ggg aac tgg ctg cag gtc ggc gac gac etc aac cac cgc aac etc gee 256 
Gly Asn Trp Leu Gin Val Gly Asp Asp Leu Asn His Arg Asn Leu Ala 
45 50 55 

gcg ctg tec cgc aag ttc ggc gac gtc ttc etc etc egg atg ggg cag 3 04 

Ala Leu Ser Arg Lys Phe Gly Asp Val Phe Leu Leu Arg Met Gly Gin 
60 65 70 75 

cgc aac ctg gtg gtg gtc teg teg ccg ccg ctg gcg egg gag gtg etc 352 
Arg Asn Leu Val Val Val Ser Ser Pro Pro Leu Ala Arg Glu Val Leu 
80 85 90 

cac acg cag ggc gtg gag ttc ggc tec cgc ace cgc aac gtg gtc ttc 400 
His Thr Gin Gly Val Glu Phe Gly Ser Arg Thr Arg Asn Val Val Phe 
95 100 105 

gac ate ttc acg gac aag ggg cag gac atg gtg ttc ace gtg tac ggc 448 
Asp He Phe Thr Asp Lys Gly Gin Asp Met Val Phe Thr Val Tyr Gly 
110 115 120 

gac cac tgg cgc aag atg cgc cgc ate atg acc gtg ccc ttc ttc ace 496 
Asp His Trp Arg Lys Met Arg Arg He Met Thr Val Pro Phe Phe Thr 
125 130 135 

aac aag gtc gtg cag cag tac cgc cac ggc tgg gag gee gag gee gee 544 
Asn Lys Val Val Gin Gin Tyr Arg His Gly Trp Glu Ala Glu Ala Ala 
140 145 150 155 
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gcc gtc gtc gac gac gtg cgc etc gac ccc aag gcg gec acc gac gga 592 

Ala Val Val Asp Asp Val Arg Leu Asp Pro Lys Ala Ala Thr Asp Gly 
160 165 170 

ate gtg etc cgc cga cgc ctg cag etc atg atg tac aac aac gta tac 640 
lie Val Leu Arg Arg Arg Leu Gin Leu Met Met Tyr Asn Asn Val Tyr 
175 180 185 

egg ate atg ttc gac egg cgc ttc gag age atg gac gac ccg etc ttc 688 
Arg lie Met Phe Asp Arg Arg Phe Glu Ser Met Asp Asp Pro Leu Phe 
190 195 200 

etc cgc etc agg gcg etc aac ggc gag cgc age cgc etc gcg cag age 736 
Leu Arg Leu Arg Ala Leu Asn Gly Glu Arg Ser Arg Leu Ala Gin Ser 
205 210 215 

ttc gag tac aac tac ggc gac ttc ate ccc ate etc cgt ccg ttc etc 784 
Phe Glu Tyr Asn Tyr Gly Asp Phe lie Pro He Leu Arg Pro Phe Leu 
220 225 230 235 

cgc ggc tac etc agg gtc tgc aag gag gtc aag gag acc cgc etc aag 832 
Arg Gly Tyr Leu Arg Val Cys Lys Glu Val Lys Glu Thr Arg Leu Lys 
240 245 250 

etc ttc aag gat ttc ttc etc gag gag agg aag aag ctg gcg age acc 880 
Leu Phe Lys Asp Phe Phe Leu Glu Glu Arg Lys Lys Leu Ala Ser Thr 
255 260 265 

aag gcc acg gac age aac ggc etc aag tgc gcc att gat cac ata ctg 928 
Lys Ala Thr Asp Ser Asn Gly Leu Lys Cys Ala He Asp His He Leu 
270 275 280 

gag gca cag cag aag ggt gag ate aac gag gac aac gtg etc ttc ate 976 
Glu Ala Gin Gin Lys Gly Glu He Asn Glu Asp Asn Val Leu Phe He 
285 290 295 

gtc gag aac att aac gtt gca gcg ate gag acc acg ctg tgg teg ate 1024 
Val Glu Asn He Asn Val Ala Ala He Glu Thr Thr Leu Trp Ser He 
300 305 310 315 

gag tgg gcg gtc get gag ctg gtg aac cac ccg gag ate cag cag aag 1072 
Glu Trp Ala Val Ala Glu Leu Val Asn His Pro Glu He Gin Gin Lys 
320 325 330 

ctg egg cag gag ctg gac acg gtg etc ggg ccg ggc cac cag ate acg 1120 
Leu Arg Gin Glu Leu Asp Thr Val Leu Gly Pro Gly His Gin He Thr 
335 340 345 

gag ccg gac acg cac aac etc ccc tac ctg cag gcg gtg ate aag gag 1168 
Glu Pro Asp Thr His Asn Leu Pro Tyr Leu Gin Ala Val He Lys Glu 
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350 355 360 

acg ctg egg ctg egg atg gec ate ccg ctg ctg gtg ccg cac atg aac 1216 
Thr Leu Arg Leu Arg Met Ala lie Pro Leu Leu Val Pro His Met Asn 
365 370 375 

etc cac gac gee aag etc ggc ggn tac gac ate ccc gee gag age aag 1264 
Leu His Asp Ala Lys Leu Gly Xaa Tyr Asp lie Pro Ala Glu Ser Lys 
380 385 390 395 

ate etc gtc aac gee tgg tac etc gee aac aac ccc gac agy tgg agg 1312 
lie Leu Val Asn Ala Trp Tyr Leu Ala Asn Asn Pro Asp Xaa Trp Arg 
400 405 410 

egg ccc gag gag ttc egg ccc gag cga ttc ytc gag gag gag aag cac 1360 
Arg Pro Glu Glu Phe Arg Pro Glu Arg Phe Xaa Glu Glu Glu Lys His 
415 420 425 

gtc gag gec aac ggc aac gac ttc agg tac ctg ccc ttc ggc gtc ggc 1408 
Val Glu Ala Asn Gly Asn Asp Phe Arg Tyr Leu Pro Phe Gly Val Gly 
430 435 440 

cgc agg age tgc ccc ggg ate ate etc gec ctg ccc ate etc ggc ate 1456 
Arg Arg Ser Cys Pro Gly lie lie Leu Ala Leu Pro lie Leu Gly lie 
445 450 455 

acc ate ggt cgc etc gtc cag aac ttc gag ctg ctg ccg ccg ccc ggg 1504 
Thr lie Gly Arg Leu Val Gin Asn Phe Glu Leu Leu Pro Pro Pro Gly 
460 465 470 475 

cag gac aag gtn gac acc acc gag aag gga ggc cag ttc agt etc cac 1552 
Gin Asp Lys Xaa Asp Thr Thr Glu Lys Gly Gly Gin Phe Ser Leu His 
480 485 490 

ate ttg aag cat tec acc ate gtg tgc aag cca aga acg ctt 1594 
lie Leu Lys His Ser Thr lie Val Cys Lys Pro Arg Thr Leu 
495 500 505 

taagagcagc ccacacgtcg gttccatgcg gagcagtcga atgttntgct ccatcaccat 1654 

gttattcggg cttaattaag cagtatcatt agtagacagt aggagtacag gaagaaaaaa 1714 

agctntggat aatgttattt gcaacaaagg gaagggaagc gaagaatntg ataactattc 1774 

aatgaagcgt tcgattnttg 1794 

<210> 27 

<211> 1669 

<212> DNA 

<213> Zea mays 

<220> 
<221> CDS 

<222> (53) . . . (1555) 



WO 99/10498 _ PCTAJS98/17519 

42 

<400> 27 

cgaacaaacc acacacccca cctaccccgg ccggaccggc aggcagcaca gc atg gac 58 

Met Asp 

1 

etc gec etc eta gag aag gee ctg ctg ggc ctg ttc gec gcg get gtg 106 
Leu Ala Leu Leu Glu Lys Ala Leu Leu Gly Leu Phe Ala Ala Ala Val 
5 10 15 

gtg gee ate gee gtg gec aag ctg acc ggc aag egg tac cgc etc cca 154 
Val Ala lie Ala Val Ala Lys Leu Thr Gly Lys Arg Tyr Arg Leu Pro 
20 25 30 

ccg ggg ccc ccg ggc gee ccc gtg gtg gga aac tgg ctg cag gtg ggc 202 
Pro Gly Pro Pro Gly Ala Pro Val Val Gly Asn Trp Leu Gin Val Gly 
35 40 45 50 

gac gac ctg aac cac cgc aac ctg atg gee atg gcg aag egg ttc ggc 250 
Asp Asp Leu Asn His Arg Asn Leu Met Ala Met Ala Lys Arg Phe Gly 
55 60 65 

gac ate ttc ctg ctg cgc atg ggc gtg cgc aac ctg gtg gtg gtg teg 298 
Asp He Phe Leu Leu Arg Met Gly Val Arg Asn Leu Val Val Val Ser 
70 75 80 

acc ccg gag ctg gec aag gag gtg etc cac acg cag ggc gtg gag ttc 346 
Thr Pro Glu Leu Ala Lys Glu Val Leu His Thr Gin Gly Val Glu Phe 
85 90 95 

ggc tec cgc acc cgc aac gtg gtg ttc gac ate ttc acg ggc aag ggg 394 
Gly Ser Arg Thr Arg Asn Val Val Phe Asp He Phe Thr Gly Lys Gly 
100 105 110 

cag gac atg gtg ttc acg gtg tac ggc gac cac tgg cgc aag atg egg 442 
Gin Asp Met Val Phe Thr Val Tyr Gly Asp His Trp Arg Lys Met Arg 
II 5 120 125 ' 130 

cgc ate atg acc gtc ccc ttc ttc acc aac aag gtg gtg gec cag aac 490 
Arg He Met Thr Val Pro Phe Phe Thr Asn Lys Val Val Ala Gin Asn 
135 140 145 

cgc gec ggg tgg gag gag gag gec egg ctg gtg gtg gag gac gtg agg 53 8 

Arg Ala Gly Trp Glu Glu Glu Ala Arg Leu Val Val Glu Asp Val Arg 
150 155 160 

aag gac ccc gag gee gcg gee ggc ggc gtc gtg etc cgc cgc cgc etc 586 
Lys Asp Pro Glu Ala Ala Ala Gly Gly Val Val Leu Arg Arg Arg Leu 
165 170 175 

cag ctg atg atg tac aac gac atg ttc cgc ate atg ttc gac cgc egg 634 
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Gin Leu Met Met Tyr Asn Asp Met Phe Arg lie Met Phe Asp Arg Arg 

180 185 X90 

ttc gac age gag cac gac ccg etc ttc aac aag etc aag gca etc aac sn, 
Phe Asp Ser Glu His Asp Pre Leu Phe Asn LyI Leu Lyl 111 Leu As"n 
195 200 205 * 210 

?? 9 ? gC 2 gc ? gc ctg tcg ca ^ a 9 c ttc gag tac aac tac ggc gac 730 

Ala Glu Arg Ser Arg Leu Ser Gin Ser Phe Glu Tyr Asn Tyr Gly Asp 
215 220 225 

ttc ate ccc gtg etc cgc ccc ttc etc cgc ggc tac etc aac cac tar 7-7« 
Phe lie Pre Val Leu Arg Pro Phe Leu Arg Gly Tyr lIu Asn Arg ??8 
230 235 240 

cac gac etc aag acg cgc cgc atg aag gtc ttc gag gac aac ttc ata 826 
His Asp Leu Lys Thr Arg Arg Met Lys Val Phe Glu Asp A^n Phe Val 
245 250 255 

cag gag cgc aag aag gtg atg get cag act ggt gag ate egg tgc gec 874 
Gin Glu Arg Lys Lys Val Met Ala Gin Thr Gly Glu lie Arg Cys Ala 
260 265 270 

m~ 9 ? at ?* C f tC gag gcc gag agg aaa ^gc gag ate aac cac gac 922 
Met Asp His He Leu Glu Ala Glu Arg Lys Gly Glu He Asn His lip 
275 280 285 290 

aac gtc etc tac ate gtc gag aac ate aac gtc gca acq ate aaa aca q 7n 
Asn val Leu Tyr He Val Glu Asn lie Asn Val Ala Til He III Tnr 
295 300 



305 



335 



1018 



Sk 9 ? 9 £ gg 5 Cg atC gag tgg ggc atc 9cc gag ctg gtg aac cac ccg 
Thr Leu Trp Ser He Glu Trp Gly He Ala Glu Leu Val Asn His Pro 
310 315 320 

f?» ?To r? 9 . f 39 ?*° cgg gag gag otc ^cc teg gtg ctg ggc gcc 1066 

Ala He Gin His Lys Leu Arg Glu Glu Leu Ala Ser Val Leu Gly Ala 
325 330 



<£? SS« Z 9 Sk 9 939 CCg gac c * c gag cgc ctc ccc tac ctt cag 1114 
Gly val Pro Val Thr Glu Pro Asp Leu Glu Arg Leu Pro Tyr Leu Gin 

340 345 350 

gcc atc gtc aag gag acg ctc cgc ctg cgc atg gcc atc ccg ctg ctg lie 2 

Ala He val Lys Glu Thr Leu Arg Leu Arg Met Ala He Pro Leu Leu 
355 360 365 370 

Si C S a ° 2*2 aac ctc aac gac «gc aag ctc gcc ggc ttc gac atc 1210 
Val Pro His Met Asn Leu Asn Asp Gly Lys Leu Ala Gly Phe Asp lie 
375 380 385 
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ccc gcc gag tec aag ate etc gtc aat gee tgg ttc etc gee aac gac 1258 
Pro Ala Glu Ser Lys lie Leu Val Asn Ala Trp Phe Leu Ala Asn Asp 
390 395 400 

ccc aag agg tgg gtg cgc ccc gac gag ttc egg ccc gag cgc ttc ctg i 30 6 
Pro Lys Arg Trp Val Arg Pro Asp Glu Phe Arg Pro Glu Arg Phe Leu 
405 410 415 

gag gag gag aag tec gtg gag gcc cac ggc aac gac ttc cga ttc gtg 1354 
Glu Glu Glu Lys ser Val Glu Ala His Gly Asn Asp Phe Arg Phe Val 
420 425 430 

S C ° £5* 2? g , gt f ggc cgc cgg agc tgc cct 9<3g ate ate etc gcg ctg 1402 
Pro Phe Gly Val Gly Arg Arg Ser Cys Pro Gly He He Leu 111 Leu 

435 440 445 450 

£™ 2 9C a w° ° tg ggC Cgg Ctg gtg cag aac ttc ctg 1450 

Pro He He Gly He Thr Leu Gly Arg Leu Val Gin Asn Phe Gin Leu 

455 460 465 

ctg ccg ccg ccg ggg ctg gac aag ate gac ace acg gag aag ccc ggc 1493 
Leu Pro Pro Pro Gly Leu Asp Lys He Asp Thr Thr Glu LyI Pro Glv 
470 475 480 

C3g l gC aaC Cag atc gcc aag cat gcc acc atc gtc tgc aag cce 1546 
Gin Phe Ser Asn Gin He Ala Lys His Ala Thr He Val Cys Lys Pro 
4 »5 490 495 



etc gag gcc tagaaatcaa tgcgtgtttc ctgcacgcgc ccccgcagat 
Leu Glu Ala ^ 
500 

tatttcttS tcgr ttCtC ttttttttgt gtgttgtgtt ttttttacta agaggagatg 1655 



<210> 28 
<211> 1356 
<212> DNA 
<213> zea mays 

<220> 
<221> CDS 

<222> (135) ... (1244) 



<400> 28 

tcgacccacg cgtccgccgc caategcagt atcggcaccg ttcttccccg cttctccagt fic 
t^SZlll agccgccccc cctccgctgc tgacactgcg aagtgcgaat caaagccacc 120 
accgcgcaea aacc atg gca ccg gtg gag gcg gag cag cae cgg cgg agg 170 
Met Ala Pro Val Glu Ala Glu Gin His Arg Arg Arg 

1 5 1 A 



5 10 
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III til Til ZtZ lit Til s 22 ^ cc g ? c gcc gtc tcc ccc * tc 

i. e u A ia j.eu Ala Ala Has Asp Ala Ser Gly Ala Val Ser Pro lie 

20 25 

SS ffi IS? 2; 2S SS S a; S P SS 25 S3 K 52 32 52 

a ss 3s i5 a a ss s ss a ss s ss a ss as 

*»° 55 60 

S 52 SS S SS 5? IS 51 SS SS 15 SS IS 52 K 15 

70 75 

52 S S IS 5! 15 51 S3 51 SS S SS 2f 15 SS 
51 ?S 15 SS 25 52 S3 SS SS S 52 2!? 52 52 IS 121 

100 105 

tgc gag gga ggg cac gag aac tac tac tcc aaa a fr af „ 
cy S Glu Gly Gly His «iS Asn T yr c£ Ser Lys !£ Jg J£ Thr gr 

115 120 

SS 111 S 25 SI |5 15 SS 51 S 51 15 25 51 51 SS 
51 S2 SS 21 SI SS SI 52 5! 52 SI 52 15 Sp 25 2? 

150 155 

HI 51 SI 15 25 S2 122 52 51 51 111 aj ffi S 5! 51 

165 170 

SI SS 52 52 SS 32 25 25 SS SI 25 25 52 32 52 SS 

180 185 

51 51 25 52 1?? 15 22 15 31 51 SS 51 52 51 111 52 

195 200 

S SS 15 52 22 S3 S 5! 5S 122 5? 52 122 25 SS IS 

215 oon 



218 



266 



314 



362 



410 



458 



506 



554 



602 



650 



698 



746 



794 



842 



890 



938 



986 



1034 
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rrg gaa get atg gag acg ctt ggt gca gac gec ttt gtt gtc age ggt 

Xaa Glu Ala Met Glu Thr Leu Gly Ala Asp Ala Phe Val Val sir Glv 
225 230 235 

gat get aac cag atg aag get gcg aag ggc aca atg gat ggc att atq 
Asp Ala Asn Gin Met Lys Ala Ala Lys Gly Thr Met Asp Gly lie Met 
240 245 250 

aac acg gee tct gca age atg tec atg tac get tac ctt get etc etc 
Asn Thr Ala Ser Ala Ser Met Ser Met Tyr Ala Tyr Leu Ala Leu Leu 
255 260 265 

aag ccc cag ggc aag atg ate ctg ctt ggc ctg cct gag aag cct ctg 

LyS ,™ Gln Gly Lys Met Ile Leu Leu G1 V Leu Pr ° Glu Lys Pro Leu 
270 275 280 

cag ate tct gee ttc tct ttg gtt act ggg ggc aag act ctg gee ggg 
Gin Ile ser Ala Phe Ser Leu Val Thr Gly Gly Lys Thr Leu Ala Gly 
285 290 295 300 

age tgc atg ggg age ate agg gac acg cag gag atg atq aac ttc aca m«? 
Ser cys Met Gly Ser lie Arg Asp Thr Gin llu Met Met As" P Pne III 
305 310 315 

gee aag cac ggg ttg gca gcg gac ate gaa ctg ate ggc ace gaa gaa 1130 
Ala Lys His Gly Leu Ala Ala Asp He Glu Leu He Gly Thr Glu Glu 
320 325 330 

gtt aat gag gee atg gaa cgc etc gee aag ggc gag gtc aaa tac cac n 7« 

Val Asn Glu Ala Met Glu Arg Leu Ala LyI Gly Glu Val Arg Tyr trg 
335 340 345 

ttc gtc ate gac ate ggc aac acc etc aac gcg gca tea eta aaa aac 
Phe Val lie Asp lie Gly Asn Thr Leu Asn III Ala Ser Leu lly sir 
350 355 360 

se? Pro £l Pro £j Leu tagCtgCggc ^tgttgat caaeaaatgc i 2 74 
365 370 

aaaaggaact £ES2£ £<*cgatat ategtgegat aagcaagtat atttggaata 1334 

<210> 29 
<211> 1465 
<212> DNA 
<213> Zea mays 

<220> 
<221> CDS 



WO 99/10498 . PCT/US98/17519 

47 

<222> (160) ... (1236) 
<400> 29 

ccacgcgtcc gcgtcctcgt ccccgttcca tttcattatc cccccggagc tggtgcgcga 60 
gtcggagctg gtgcagggtc acccttcccc tcggcctcaa gagctctgcg gttgccgcgg 120 
ccaagggcgt ccgtggagaa gcgggagcag gtggcggcg atg gaa gag caa ggc 174 

Met Glu Glu Gin Gly 
1 5 

ggc cag gcg gcg etc ggg tgg gcg gec agg gac gac tec ggc gtc etc 222 
Gly Gin Ala Ala Leu Gly Trp Ala Ala Arg Asp Asp Ser Gly Val Leu 
10 15 20 

tec ccc tac age ttc tec aga agg gtt cct aaa gac gac gat gtc acq 270 
Ser Pro Tyr Ser Phe Ser Arg Arg Val Pro Lys Asp Asp Asp Val Thr 
25 30 * 35 

ate aag gtg etc tac tgc ggg ate tgc cac acc gac ctg cac gtc ate 318 
He Lys Val Leu Tyr Cys Gly He Cys His Thr Asp Leu His Val He 
40 45 50 

aag aac gac tgg cga aac gec atg tac cca gtc gtc ccg ggg cac aaa 366 
Lys Asn Asp Trp Arg Asn Ala Met Tyr Pro Val Val Pro Gly His Glu 
55 60 65 

ate gtg ggc gtt gtg acc ggc gtc ggc ggc ggc gtc acg egg ttc aaa 414 
lie Val Gly Val Val Thr Gly Val Gly Gly Gly Val Th? A?g Phe Lyl 
70 75 80 85 

gee ggc gac acg gtc ggc gtg ggc tac ttc gtg ggg tec tgc cgc tec 462 
Ala Gly Asp Thr Val Gly Val Gly Tyr Phe Val Gly Ser Cys Arg Ser 
90 95 100 

tgc gac age tgc ggc aag ggg gac gac aac tac tgc gcg ggg ate gtq 510 
Cys Asp Ser Cys Gly Lys Gly Asp Asp Asn Tyr Cys Ala Gly He Val 
105 110 115 

etc acc tec aac ggc gtc gac cac gcg cac ggc ggc gcg ccc acc aga 558 
Leu Thr Ser Asn Gly Val Asp His Ala His Gly Gly All Pro Thr Arg 
120 125 130 

2? g gga 11° tCC gac 91:0 ctg gtc aca aac aa 9 cac tac gtg gtc cgc 606 
Gly Gly Phe Ser Asp Val Leu Val Ala Ser Glu His Tyr Val Val Ara 
135 140 145 

gtc ccc gac ggc ctg gcg ctg gac cgc acc gcg ccg ctg etc tgc gee 654 
Val Pro Asp Gly Leu Ala Leu Asp Arg Thr Ala Pro Leu Leu Cys Ala 
150 155 160 165 

2?° SP? tZ C 2*? taC agc ccc atg atg cac cac 99C etc aac gag ccc 702 
Gly Val Thr Val Tyr Ser Pro Met Met Arg His Gly Leu Asn Glu Pro 
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170 175 180 

ggc aag cac teg gcg ttc gtc ggc etc ggc ggc etc ggc cac gtc gee 750 
Gly Lys His Ser Ala Phe Val Gly Leu Gly Gly Leu Gly His Val Ala 
185 190 195 

gtc aag ttc ggc aag gec ttc ggg atg aag gtc acc gtc ate age acg 798 
Val Lys Phe Gly Lys Ala Phe Gly Met Lys Val Thr Val lie Ser Thr 
200 205 210 

tec gee age aag cgc cag gag gec ate gag aac etc ggc gcg gac gag 84 6 

Ser Ala Ser Lys Arg Gin Glu Ala lie Glu Asn Leu Gly Ala Asp Glu 
215 220 225 

ttc etc ate age egg gac gag gac cag atg aag gcg gcg acg ggg acc 894 
Phe Leu lie Ser Arg Asp Glu Asp Gin Met Lys Ala Ala Thr Gly Thr 
230 235 240 245 

atg gac ggc ate ate gac acg gtg teg gcg tgg cac ccg ate acg ccg 942 
Met Asp Gly lie lie Asp Thr Val Ser Ala Trp His Pro lie Thr Pro 
250 255 260 

ctg ctg gcg ctg ctg aag ccg ctg ggg cag atg gtg gtc gtg ggc gcg 990 
Leu Leu Ala Leu Leu Lys Pro Leu Gly Gin Met Val Val Val Gly Ala 
265 270 275 

ccg age aag ccg etc gag ctg ccg gee tac gec ate gtg ccg ggc ggg 1038 
Pro Ser Lys Pro Leu Glu Leu Pro Ala Tyr Ala lie Val Pro Gly Gly 
280 285 290 

aag ggc gtg get ggg aac aat gtc ggc age gtc agg gac tgc cag gee 1086 
Lys Gly Val Ala Gly Asn Asn Val Gly Ser Val Arg Asp Cys Gin Ala 
295 300 305 

atg etc gag ttc gcg ggg aag cac ggc ate ggg gec gag gtc gag gtc 1134 
Met Leu Glu Phe Ala Gly Lys His Gly lie Gly Ala Glu Val Glu Val 
310 315 320 325 

ate aag atg gac tac gtc aac acg gee atg gag egg etc gag aag aac 1182 
lie Lys Met Asp Tyr Val Asn Thr Ala Met Glu Arg Leu Glu Lys Asn 
330 335 340 

gac gtc cgc tac cgc ttc gtc ate gac gtc gee ggc age etc ggc tct 1230 
Asp Val Arg Tyr Arg Phe Val lie Asp Val Ala Gly Ser Leu Gly Ser 
345 350 355 

gee gec taggcatggc tgcaaaggtt tcaatcagag cccagccgca ataatttgtt 1286 
Ala Ala 

agctaccgaa tgaatgatgg tetaegcttg ttgatgagtt ggtgctttgt cgtggttttg 134 6 
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tggatgtaat aattcgatgt acaaataaaa aaaggggaga caaqotcrctt attccctt-arf iaha 

tttggtgaca acttgttcgt ttacaccgat ctatltctla attlgltga SSaaat? 9 itet 

<210> 30 
<211> 1316 
<212> DNA 
<213> Zea mays 

<220> 
<221> CDS 

<222> (58) . . . (1131) 
<400> 30 

ccacgcgtcc gggttgaggt tgaggaagag cagaggaatc cattccagga gtgttta atg 60 

Met 
1 

gcg gga ggc aag gaa gcg cac ggg tgg gca gcc agg gat gtc tct ggt 108 
Ala Gly Gly Lys Glu Ala His Gly Trp Ala Ala Arg Asp Val Ser Gly 
5 10 15 

cac etc tec cct tac cac ttc tea egg agg gtt cag aga crac aac aac i *a 

His Leu Ser Pro Tyr His Phe Ser Arg A?I Val Gin Irg Isp lsp Isp 6 
20 25 30 

mS C ^° aag gtg ctc ttc tgc ctt tgc cac act gac etc cac 204 

val Thr lie Lys val Leu Phe Cys Gly Leu Cys His Thr Lp Leu His 
35 40 45 

gtc ate aag aac gag ttt ggc aac gcc aag tac ccc gtc gtt ccc ggq 252 
Val lie Lys Asn Glu Phe Gly ten Ala Lys Tyr Pro Val Val Pro Gly 
50 55 60 6 5 

St« ri?, ?^ 2? c gtc gtc acc gac gtc ggc tcc gtc aca agc 300 

Hxs Glu lie Val Gly val val Thr Asp Val Gly Ser Gly Val Thr ser 
70 75 80 

ttc aag ccc ggc gac acg gtg ggc gtg ggc tac ttc gtc gac tcc tgc 34« 
Phe Lys Pro Gly Asp Thr Val Gly Val Gly Tyr Phe Val Asp Ser C?s 
85 90 95 

cgc age tgc gac agc tgc agc aag ggg tac gag age tac tgc ccg cag 396 
Arg Ser Cys Asp Ser cys Ser Lys Gly Tyr Glu Ser Tyr Cys Prl Gin 
100 105 110 

etc gtg gag acg tcc aac ggc gtg agc ctg gac gac gat gac ggc ggc 444 
Leu Val Glu Thr Ser Asn Gly Val Ser Leu Asp Asp Asp Lp Gly lly 
115 120 125 

??° aag ggc ggc ttc tcc g ac g cc ctc g* c gtc cac cag cgc 492 

Ala Thr Thr Lys Gly Gly Phe Ser Asp Ala Leu Val Val His Gin Arg 
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130 135 



140 



145 



£ ?2 S3 3! !S S S K ffl a a S 1!? E H S 

150 155 160 

3 a & E II? ?2 s S3 S ffi a K a S & IS 

a sh a is 5! e a a a a is a is is a 

185 190 

is a a a a a a is a a a is a a a a 
a a a a a a is a a a a a a is a a 



220 



540 



588 



636 



684 



732 



780 



is a a a a a a s a a a is sg a a a 

235 240 

gcg gcg ggc acc ttg gac oac ate ate aar ar« < 

Ala Ala «i y Thr Le * Lp g y ?£ Jg g Jg ?g K 2? ffi 

250 255 

a a a ss a a a a a a a a is a a a •» 

265 270 

a a is a a a a a a a a a a a a a •» 

280 285 

a a is is a a a a is a a is is is a is 
a a a a a a a a a is a a is a a a 

i0 315 320 

a a a a a is a a a a a a a a a a 

33 0 335 
eta gag agg aac gat gtc agg tac cgc ttc gtt gtc gac gtc gcg ggc 



972 



1020 



1068 



1116 
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Leu Glu Arg Asn Asp Val Arg Tyr Arg Phe Val Val Asp Val Ala Gly 
340 345 350 

age aag att gga ggc taggcatcac cattcctagt gttctgtcga tcgacgtgtg 1171 
Ser Lys lie Gly Gly 
355 

atttgettet tcctcgagcg tgtcttattg ttctggttgg ageaegtacg cggccatcac 1231 

aegcaggegt ggataataaa caaggtagag tttcgggttg tgtcgtttct ggatgtatgg 1291 

tgccggtgga taataaacaa gcttg 1316 

<210> 31 

<211> 1160 

<212> DNA 

<213> Zea mays 

<220> 
<221> CDS 

<222> (167) . . . (940) 
<400> 31 

acccacgcgt ccgcgccttg ccgccgcgcg ttatataagc cgccccggca ggcaaggtcg 60 
gtcaatccag caatacccga gtacccgacg cgctagctag ttctattgee gcgcacccca 120 
gatctccagg agggactcgt tegttcaget aactacactg cacgea atg gec acc 175 

Met Ala Thr 
1 

acg gcg acc gag gcg gcg ccg gcg cag gag cag cag gec aac ggc aac 22 3 

Thr Ala Thr Glu Ala Ala Pro Ala Gin Glu Gin Gin Ala Asn Gly Asn 
5 10 15 

ggc gag cag aag acg egg cac tec gag gtc ggc cac aag age ctg etc 271 
Gly Glu Gin Lys Thr Arg His Ser Glu Val Gly His Lys Ser Leu Leu 
20 25 30 35 

aag age gac gac etc tac cag tac ate ctg gac acg age gtg tac ccg 319 
Lys Ser Asp Asp Leu Tyr Gin Tyr lie Leu Asp Thr Ser Val Tyr Pro 
40 45 50 

egg gag ccg gag age atg aag gag etc cgc gag ate acc gee aag cac 367 
Arg Glu Pro Glu Ser Met Lys Glu Leu Arg Glu lie Thr Ala Lys His 
55 60 65 

cca tgg aac ctg atg acg acc tec gee gac gag ggg cag ttc ctg aac 415 
Pro Trp Asn Leu Met Thr Thr Ser Ala Asp Glu Gly Gin Phe Leu Asn 
70 75 80 

atg etc ate aag etc ate ggc gec aag aag acc atg gag ate ggc gtg 463 
Met Leu lie Lys Leu lie Gly Ala Lys Lys Thr Met Glu lie Gly Val 
85 90 95 
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tac acc ggc tac tec etc etc gcc acg gcg etc gcc etc ccg gag gac 511 
Tyr Thr Gly Tyr Ser Leu Leu Ala Thr Ala Leu Ala Leu Pro Glu Asp 
100 105 no H5 

ggc acg ate ttg gcc atg gac ate aac cgc gag aac tac gag ctg ggc 559 
Gly Thr lie Leu Ala Met Asp lie Asn Arg Glu Asn Tyr Glu Leu Gly 
120 125 130 

ctg ccc tgc ate gag aag gcc ggc gtc gcc cac aag ate gac ttc cgc 607 
Leu Pro Cys He Glu Lys Ala Gly Val Ala His Lys He Asp Phe Ara 
135 140 145 

gag ggc ccc gcg etc ccc gtc etc gac gac etc ate gcg gag gag aag 655 
Glu Gly Pro Ala Leu Pro Val Leu Asp Asp Leu He Ala Glu Glu Lys 
150 155 160 

aac cac ggg teg ttc gac ttc gtc ttc gtg gac gcc gac aag gac aac 703 
Asn His Gly Ser Phe Asp Phe Val Phe Val Asp Ala Asp Lys Asp Asn 
165 170 175 

tac etc aac tac cac gag egg ctg ctg aag ctg gtg aag ctg ggc ggc 751 
Tyr Leu Asn Tyr His Glu Arg Leu Leu Lys Leu Val Lys Leu Gly Gly 
180 185 190 195 

etc ate ggc tac gac aac acg ctg tgg aac ggc tec gtc gtg etc ccc 799 
Leu He Gly Tyr Asp Asn Thr Leu Trp Asn Gly Ser Val Val Leu Pro 
200 205 210 

gac gac gcg ccc atg cgc aag tac ate cgc ttc tac cgc gac ttc gtg 847 
Asp Asp Ala Pro Met Arg Lys Tyr He Arg Phe Tyr Arg Asp Phe Val 
215 220 225 

etc gtc etc aac aag gcg etc gcc gcc gac gac cgc gtc gag ate tgc 895 
Leu Val Leu Asn Lys Ala Leu Ala Ala Asp Asp Arg Val Glu He Cys 
230 235 240 

cag etc ccc gtc ggc gac ggc gtc acc etc tgc cgc cgc gtc aag 940 
Gin Leu Pro Val Gly Asp Gly Val Thr Leu Cys Arg Arg Val Lys 
245 250 255 

tgaaaacatg ccctggcctg ccccaccacc gccaccgacg gcgccgccgg ccgcatcctc 1000 

attccaatca taatagacga cccgcagcat taattatcca ccggcttttt ttttggctct 1060 

ttgttgcccc ctgtaatctt tctcctcctc ttcttcttgg gaattgtcgc cgccgtttcg 1120 

ataegtaaat cacgagatcg gtaatacagt aatgctcctc 1160 

<210> 32 

<211> 944 

<212> DNA 

<213> zea mays 



<220> 
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<221> CDS 

<222> (60). ..(803) 

<400> 32 

ccacgcgtcc gcttcccaca agaacatcac acggcgatag caaaaccttc cgtcggagg 59 
atg get tec gec ggc get gga gaa ggg aag gag acg get gee ggg age 107 
Met Ala Ser Ala Gly Ala Gly Glu Gly Lys Glu Thr Ala Ala Gly Ser 
15 10 15 

age etc cac age aag act etc etc aag age caa cca ctg tac cag tac 155 
Ser Leu His Ser Lys Thr Leu Leu Lys Ser Gin Pro Leu Tyr Gin Tyr 
20 25 30 

ata ctg gaa tec acc gtc ttc cca cgc gag ccg gac tgc ctg egg gag 20 3 

lie Leu Glu Ser Thr Val Phe Pro Arg Glu Pro Asp Cys Leu Arg Glu 
35 40 45 

etc cgc gtc gee acc gee acc cac ccc atg gcg ggc atg get gcg teg 251 
Leu Arg Val Ala Thr Ala Thr His Pro Met Ala Gly Met Ala Ala Ser 
50 55 60 

ccg gac gag gtg cag ctg ctg cag etc ctg ate gag att ctt ggc gee 299 
Pro Asp Glu Val Gin Leu Leu Gin Leu Leu lie Glu lie Leu Gly Ala 
65 70 75 80 

aag aac gee ate gag gtt ggc gtc ttc acc ggg tac teg ctg etc gee 347 
Lys Asn Ala lie Glu Val Gly Val Phe Thr Gly Tyr Ser Leu Leu Ala 
85 90 95 

acc gee etc gee etc ccc gac gac ggc aag att gtg gee ate gac gtt 395 
Thr Ala Leu Ala Leu Pro Asp Asp Gly Lys lie Val Ala lie Asp Val 
100 105 110 

acc cgc gag age tac gac cag ata ggg teg ccg gtg ate gag aag gee 443 
Thr Arg Glu Ser Tyr Asp Gin lie Gly Ser Pro Val lie Glu Lys Ala 
115 120 125 

ggc gtg gcg cac aag ate gac ttc cgc gtc ggg etc gcg ctg ccc gtg 491 
Gly Val Ala His Lys lie Asp Phe Arg Val Gly Leu Ala Leu Pro Val 
130 135 140 

ctg gac cag atg gtg gee gag gag ggg aac aag ggc aag ttc gac ttc 539 
Leu Asp Gin Met Val Ala Glu Glu Gly Asn Lys Gly Lys Phe Asp Phe 
145 150 155 160 

gcg ttc gtg gac gcg gac aag gtg aac ttc etc aac tac cac gag egg 587 
Ala Phe Val Asp Ala Asp Lys Val Asn Phe Leu Asn Tyr His Glu Arg 
165 170 175 

ctg ctg cag ctg etc agg gtc ggg ggc etc ate gee tac gac aac acg 635 
Leu Leu Gin Leu Leu Arg Val Gly Gly Leu lie Ala Tyr Asp Asn Thr 
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180 185 190 

ctg tgg ggc ggc tec gtg gec gcg tec ccc gac gag ccg etc tec gag 683 
Leu Trp Gly Gly Ser Val Ala Ala Ser Pro Asp Glu Pro Leu Ser Glu 
195 200 205 

egg gac cgc gcg etc get gcg gee ace agg gag ttc aac gcg gee gtg 731 
Arg Asp Arg Ala Leu Ala Ala Ala Thr Arg Glu Phe Asn Ala Ala Val 
210 ^ 215 220 

gee gee gat ccc cgc gtt cac gtc tgc cag gtc gee ate gee gac ggg 779 
Ala Ala Asp Pro Arg Val His Val Cys Gin Val Ala lie Ala Asp Gly 
225 230 235 240 

etc acg ctg tgc cgc cgc gtc gee tgatcegtat ccggttatcc gectcgaaat 833 
Leu Thr Leu Cys Arg Arg Val Ala 
245 

acagcagagc tgtgggctgt cgctgacact gctgtgagct ctgtgcttga aatggccatg 893 
gtctgtaata acgaactggg ettgagegaa aataaatcca ccagcgcgct a 944 

<210> 33 
<211> 1003 
<212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (55). ..(798) 

<400> 33 

acagctagca ccaccacctt gcaccgcacc cgcaccgaga cgaacagatc gacc atg 57 

Met 
1 

get gee ggc ggc gac gac ace ace ate gcg cag gtc cac age ggc ate 105 
Ala Ala Gly Gly Asp Asp Thr Thr lie Ala Gin Val His Ser Gly lie 
5 10 15 

gac age age aac aag acg ctg etc aag age gag gee etc tac aag tac 153 
Asp Ser Ser Asn Lys Thr Leu Leu Lys Ser Glu Ala Leu Tyr Lys Tyr 
20 25 30 

gtg ctg gac acg teg gtg ctg ccg cac gag ccg gag age atg cgt gag 201 
Val Leu Asp Thr Ser Val Leu Pro His Glu Pro Glu Ser Met Arg Glu 
35 40 45 

ctg egg ctg gtg ace gac aag cac gag tgg ggg ttc atg cag teg tec 249 
Leu Arg Leu Val Thr Asp Lys His Glu Trp Gly Phe Met Gin Ser Ser 
50 55 60 65 
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ccg gac gag gcg teg ctg ctg egg atg ctg ate aag ctg age ggc gcg 297 

Pro Asp Glu Ala Ser Leu Leu Arg Met Leu lie Lys Leu Ser Gly Ala 
70 75 80 

egg egg acg ctg gag gtg ggc gtg ttc acg ggc tac teg ctg ctg gcg 345 
Arg Arg Thr Leu Glu Val Gly Val Phe Thr Gly Tyr Ser Leu Leu Ala 
85 90 95 

acg get ctg gcg ctg ccc gec gac ggc aag gtc ate gca ttc gac gtg 393 
Thr Ala Leu Ala Leu Pro Ala Asp Gly Lys Val lie Ala Phe Asp Val 
100 105 110 

age cgc gag tac tac gac ate ggc cgc ccc ttc ate gag cgc gee ggg 441 
Ser Arg Glu Tyr Tyr Asp lie Gly Arg Pro Phe lie Glu Arg Ala Gly 
115 120 125 

gtg gcg ggc aag gtg gac ttc egg gag ggc ccg gcg ctg gag cag ctg 489 
Val Ala Gly Lys Val Asp Phe Arg Glu Gly Pro Ala Leu Glu Gin Leu 
130 135 140 145 

gac gag etc etc gee gac ccg gee aac cac ggc gee ttc gac ttc gee 537 
Asp Glu Leu Leu Ala Asp Pro Ala Asn His Gly Ala Phe Asp Phe Ala 
150 155 160 

ttc gtc gac gee gac aag cct aac tac gtc egg tac cac gag cag ctg 585 
Phe Val Asp Ala Asp Lys Pro Asn Tyr Val Arg Tyr His Glu Gin Leu 
165 170 175 

etc cgc ctg gtg cgc gtc ggg ggt ace gtc gtg tac gac aac acg ctg 633 
Leu Arg Leu Val Arg Val Gly Gly Thr Val Val Tyr Asp Asn Thr Leu 
180 185 190 

tgg gee ggt act gtg gcg ctt ccc ccc gac gcg ccg etc age gac etc 681 
Trp Ala Gly Thr Val Ala Leu Pro Pro Asp Ala Pro Leu Ser Asp Leu 
195 200 205 

gac cgc agg ttc tec gee gee ate agg gaa etc aac gtc egg ctt tct 729 
Asp Arg Arg Phe Ser Ala Ala lie Arg Glu Leu Asn Val Arg Leu Ser 
210 215 220 225 

cag gat ccc cgc gtc gag gtc tgc cag etc gee ate gec gac ggc gtc 777 
Gin Asp Pro Arg Val Glu Val Cys Gin Leu Ala lie Ala Asp Gly Val 
230 235 240 

ace ate tgc cgc cgc gtc gtc tgatgtgatg atgatccgac gaccaagatc 828 
Thr lie Cys Arg Arg Val Val 
245 

atatatcatt cgctcgtcgt ctctgtcatc tttcaactgc ctgcccgccg ctgtccgctg 888 

ccgtcgtcaa ttaataatgc atggttcttg ttcttttttt ttttgtactt gcactgtgtg 948 

tgttgagttg aacatccggc gatgtactgc aacaactgga atgeaatgea acaaa 1003 
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<210> 34 
<211> 1559 
<212> DNA 
<213> Zea mays 

<220> 
<221> CDS 

<222> (231) ... (1343) 
<400> 34 

gtcgacccac gcgtccggga agagagaatc ctaccaaacc tagctaccaa ctcgatcgtc 60 
gtcatcacgc tcgaccgcac aactgcacca aggggggagg agacctaaaa actactacat 120 
cttttagcta cacatctagc taaagatcga gaggggtaaa taaggacgag cgggcgcgag 180 
ctagaagagc agctgcaggt actaccatca tcgtcgtcgt cgtcgccagg atg acc 23 6 

Met Thr 
1 

gtc gtc gac gcc gtc gtc tec tec acc gat gec ggc gec cct get gec 284 
Val Val Asp Ala Val Val Ser Ser Thr Asp Ala Gly Ala Pro Ala Ala 
5 10 15 

gcc gcc acc gcg gta ccg gcg ggg aac ggg cag acc gtg tgc gtg acc 332 
Ala Ala Thr Ala val Pro Ala Gly Asn Gly Gin Thr Val Cys Val Thr 
20 25 30 

ggc gcg gcc ggg tac ate gcc teg tgg ttg gtg aag ctg ctg etc gag 380 
Gly Ala Ala Gly Tyr lie Ala Ser Trp Leu Val Lys Leu Leu Leu Glu 
35 40 45 50 

aag gga tac act gtg aag ggc acc gtc agg aac cca gat gac ccg aag 428 
Lys Gly Tyr Thr Val Lys Gly Thr Val Arg Asn Pro Asp Asp Pro Lys 
55 60 65 

aac gcg cac etc aag gcg ctg gac ggc gcc gcc gag egg ctg ate etc 476 
Asn Ala His Leu Lys Ala Leu Asp Gly Ala Ala Glu Arg Leu lie Leu 
70 75 80 

tgc aag gcc gat ctg ctg gac tac gac gcc ate tgc cgc gcc gtg cag 524 
Cys Lys Ala Asp Leu Leu Asp Tyr Asp Ala lie Cys Arg Ala Val Gin 
85 90 95 

ggc tgc cag ggc gtc ttc cac acc gcc tec ccc gtc acc gac gac ccg 57 2 

Gly Cys Gin Gly Val Phe His Thr Ala Ser Pro Val Thr Asp Asp Pro 
100 105 no 

gag caa atg gtg gag ccg gcg gtg cgc ggc acc gag tac gtg ate aac 620 
Glu Gin Met Val Glu Pro Ala Val Arg Gly Thr Glu Tyr Val lie Asn 
115 120 125 130 

gcg gcg gcg gat gcc ggc acg gtg egg egg gtg gtg ttc acg teg tec 668 
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Ala Ala Ala Asp Ala Gly Thr Val Arg Arg Val Val Phe Thr Ser Ser 
135 140 145 

ate ggc gec gtg acc atg gac ccc aag cgc ggg ccc gac gtc gtg gtc 716 
lie Gly Ala Val Thr Met Asp Pro Lys Arg Gly Pro Asp Val Val Val 
150 155 160 

gac gag teg tgc tgg age gac etc gag ttc tgc gag aaa acc agg aac 764 
Asp Glu Ser Cys Trp Ser Asp Leu Glu Phe Cys Glu Lys Thr Arg Asn 
165 170 175 

tgg tac tgc tac ggc aag gcg gtg gcg gaa cag gcg gcg tgg gag acg 812 
Trp Tyr Cys Tyr Gly Lys Ala Val Ala Glu Gin Ala Ala Trp Glu Thr 
180 185 190 

gee egg egg egg ggc gtg gac ctg gtg gtg gtg aac ccc gtg ctg gtg 860 
Ala Arg Arg Arg Gly Val Asp Leu Val Val Val Asn Pro Val Leu Val 
195 200 205 210 

gtg ggc ccc ctg ctg cag gcg acg gtg aac gee age ate gcg cac ate 908 
Val Gly Pro Leu Leu Gin Ala Thr Val Asn Ala Ser lie Ala His lie 
215 220 225 

etc aag tac ctg gac ggc teg gee cgc acc ttc gee aac gee gtg cag 956 
Leu Lys Tyr Leu Asp Gly Ser Ala Arg Thr Phe Ala Asn Ala Val Gin 
230 235 240 

gcg tac gtg gac gtg cgc gac gtg gee gac gcg cac etc cgc gtc ttc 1004 
Ala Tyr Val Asp Val Arg Asp Val Ala Asp Ala His Leu Arg Val Phe 
245 250 255 

gag age ccc cgc gcg tec ggc cgc can etc tgc gee gag cgc gtc etc 1052 
Glu Ser Pro Arg Ala Ser Gly Arg Xaa Leu Cys Ala Glu Arg Val Leu 
260 265 270 

cac cgc gag gac gtc gtc cgc ate etc gee aag etc ttc ccc gag tac 11 00 

His Arg Glu Asp Val Val Arg He Leu Ala Lys Leu Phe Pro Glu Tyr 
275 280 285 290 

ccc gtc cca gee agg tgc tec gac gag gtg aat ccg egg aag cag ccg 1148 
Pro Val Pro Ala Arg Cys Ser Asp Glu Val Asn Pro Arg Lys Gin Pro 
295 300 305 

tac aag ttc tec aac cag aag etc egg gac ctg ggg ctg cag ttc egg 1196 
Tyr Lys Phe Ser Asn Gin Lys Leu Arg Asp Leu Gly Leu Gin Phe Arg 
310 315 320 

ccg gtc age cag teg ctt tac gac acg gtg aag aac etc cag gag aag 1244 
Pro Val Ser Gin Ser Leu Tyr Asp Thr Val Lys Asn Leu Gin Glu Lys 
325 330 335 
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gga cac ctg ccg gtg etc gga gag egg acg acg acg gag gec gec gac 1292 

Gly His Leu Pro Val Leu Gly Glu Arg Thr Thr Thr Glu Ala Ala Asp 

340 345 350 

aag gat gec ccc acg gec gag atg cag cag gga ggg ate gec ate cgt 1340 
Lys Asp Ala Pro Thr Ala Glu Met Gin Gin Gly Gly lie Ala lie Arg 
355 360 365 370 

gee tgagagggcg atgccacaca tgaacacaaa gcaatgttca tactgctgcc 139 3 

ctgcacctgc tgtgtaaaca ggcctgtgtt tgttctggct gatagtgatg taccctaaga 1453 

cttgtaacgt catgttcgtt cttgtgaact atagegagtg aataaaattg gttaatgttg 1513 

gatgttcaaa aaaaaaaaaa aaaaaaaaaa aaaaaagggc ggcege 1559 

<210> 35 
<211> 755 
<212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (3) . . . (533) 

<400> 35 

cc acg cgt ccg gtc gtg ggc etc gac cgc aac gtg age gag teg gac 47 
Thr Arg Pro Val Val Gly Leu Asp Arg Asn Val Ser Glu Ser Asp 
15 10 15 

ctg gac agg etc ccc ttc etc agg tgc gtc ate aag gag acg etc egg 9 5 

Leu Asp Arg Leu Pro Phe Leu Arg Cys Val lie Lys Glu Thr Leu Arg 
20 25 30 

ctg cac ccg ccc ate ccg ctg etc etc cac gag acc gee gac gac tgc 143 
Leu His Pro Pro He Pro Leu Leu Leu His Glu Thr Ala Asp Asp Cys 
35 40 45 

gtc gtg gee ggg tac tec gtg ccc agg ggc tec cgc gtc atg gtc aac 191 
Val Val Ala Gly Tyr Ser Val Pro Arg Gly Ser Arg Val Met Val Asn 
50 55 60 

gtc tgg gee ate ggc cgc cac cgc gee teg tgg aag gac gee gac gcg 2 39 

Val Trp Ala He Gly Arg His Arg Ala Ser Trp Lys Asp Ala Asp Ala 
65 70 75 

ttc cgc ccg teg egg ttc gcg gcg ccc gag ggg gag gee gcg ggg etc 287 
Phe Arg Pro Ser Arg Phe Ala Ala Pro Glu Gly Glu Ala Ala Gly Leu 
80 85 90 95 

gac ttc aag ggc ggg tgc ttc gag ttc ctg ccg ttc ggg teg ggc cgc 335 
Asp Phe Lys Gly Gly Cys Phe Glu Phe Leu Pro Phe Gly Ser Gly Arg 
100 105 110 
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egg tec tgc ccc ggg atg gcg etc ggc ctg tac gcg ctg gag etc gee 383 

Arg Ser Cys Pro Gly Met Ala Leu Gly Leu Tyr Ala Leu Glu Leu Ala 
115 120 125 

gtc gee cag etc gcg cac gee ttc aac tgg teg ctg ccc gac gga atg 431 
Val Ala Gin Leu Ala His Ala Phe Asn Trp Ser Leu Pro Asp Gly Met 
130 135 140 

aag ccc teg gag atg gac atg ggc gac ate ttc ggc ctt acc gcg ccg 479 
Lys Pro Ser Glu Met Asp Met Gly Asp He Phe Gly Leu Thr Ala Pro 
145 150 155 

cgc gee acg egg etc tac gee gtg cct acg ccc egg etc aac tgc ccc 527 
Arg Ala Thr Arg Leu Tyr Ala Val Pro Thr Pro Arg Leu Asn Cys Pro 
160 " 165 170 175 

ttg tac tgacgccctg cacgtggcgc gcggggactg ccattacgca tgcatgcgtt 58 3 
Leu Tyr 

tggactttgg tgttcatccc tggggtgggg ccgccgtggg ggaagttagg agtttggtgg 64 3 

ctttctagct ctgtcttctt gtattctgtt tattataaat tttcccaacc cttccatgcc 703 

tgatcgatgt gcggtaataa ttgttagaaa atgtgacatt ttgtatgtaa tc 755 

<210> 36 
<211> 972 
<212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (2)... (706) 

<400> 36 

g gca cga gat ttc ccg gac ggt ccg ccg ccc age ggc acg gcg atg teg 49 

Ala Arg Asp Phe Pro Asp Gly Pro Pro Pro Ser Gly Thr Ala Met Ser 

15 10 15 

gtg ggg acc aag etc aat aag etc age tac aac teg gtg gtg gag ate 97 
Val Gly Thr Lys Leu Asn Lys Leu Ser Tyr Asn Ser Val Val Glu He 
20 25 30 

gtg ctg cag aac ccg gcg gec gtc ccg acg gag aac cac ccg ate cac 145 
Val Leu Gin Asn Pro Ala Ala Val Pro Thr Glu Asn His Pro He His 
35 40 45 

etc cac ggc ttc aac ttc ttc gtg ctg gcg cag ggg atg ggt acc ttc 193 
Leu His Gly Phe Asn Phe Phe Val Leu Ala Gin Gly Met Gly Thr Phe 
50 55 60 

gee ccg gga age gtg gee tac aac ctg gtg gac ccg gtg gee cgc aac 241 
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Ala Pro Gly Ser Val Ala Tyr Asn Leu Val Asp Pro Val Ala Arg Asn 
65 70 75 80 

acc ate gec gtg cct ggc ggt ggc tgg get gtc ata cgc ttc gtc gec 289 
Thr lie Ala Val Pro Gly Gly Gly Trp Ala Val He Arg Phe Val Ala 
85 90 95 

aac aat cca ggc atg tgg ttc ttt cac tgc cac ctg gac ccg cac gtg 3 37 

Asn Asn Pro Gly Met Trp Phe Phe His Cys His Leu Asp Pro His Val 
100 105 110 

cct atg ggc ctg ggc atg gtg ttc cag gtg gac age ggg acg acg ccc 385 
Pro Met Gly Leu Gly Met Val Phe Gin Val Asp Ser Gly Thr Thr Pro 
115 120 125 

ggc tec acg etc cct acg ccg ccg ggg gat tgg gtg gga gta tgc gac 4 33 

Gly Ser Thr Leu Pro Thr Pro Pro Gly Asp Trp Val Gly Val Cys Asp 
130 135 140 

gcg cag cac tac gcg gee gcg gcg gcg gta gca gca gcg ccg gtg cca 481 
Ala Gin His Tyr Ala Ala Ala Ala Ala Val Ala Ala Ala Pro Val Pro 
145 150 155 160 

gtt ccg gee cca gee cca gtc cca gca cca ate eta gcg cca gca cca 529 
Val Pro Ala Pro Ala Pro Val Pro Ala Pro He Leu Ala Pro Ala Pro 
165 170 175 

gca gaa teg ccg ttg cca cct ccg cgc gcg gtg gac cac aag ccg teg 577 
Ala Glu Ser Pro Leu Pro Pro Pro Arg Ala Val Asp His Lys Pro Ser 
180 185 190 

ccc aac ctt cct cag cgc agg gag cac acg ggt acc tct aat tec get 625 
Pro Asn Leu Pro Gin Arg Arg Glu His Thr Gly Thr Ser Asn Ser Ala 
195 200 205 

get gga egg aga get aag ggg cac etc get tgt ttc ttg tgt tct gtc 673 
Ala Gly Arg Arg Ala Lys Gly His Leu Ala Cys Phe Leu Cys Ser Val 
210 215 220 

etc ctt ttc ttt ctt ctt cgt caa cac aag gee tagctcatgg gaagctttgg 7 26 
Leu Leu Phe Phe Leu Leu Arg Gin His Lys Ala 
225 230 235 

cgtgataccc gegcagaate tgcagttttg ttcgttcgtg taagtaatat gaaattgttt 786 

attctttaaa tgttttatac acgtgtatat ggacttatat tttgatgtaa ttgtgtgacc 846 

ttttccttct ccacgtgggc agttgtgcat agcaaagttc atgtttaggg tttattggct 906 

ctctgtattc atgatagaca ttgatacaaa gtaatatcat acactaeggt ttgatttgaa 966 

accccc 97 2 

<210> 37 
<211> 25 



WO 99/10498 



PCT/US98/17519 



<212> DNA 
<213> Zea mays 

<400> 37 
atgggcgacg cggccatcgc cgccg 

<210> 38 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 38 
cgctgctgcc gtcttcttga cgaac 

<210> 39 

<211> 25 

<212> DNA 

<213> Zea mays 

<400> 39 
atgggttccg tagacgcggc gatcg 

<210> 40 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 40 
ctgaacaccg gcggcgagcc tggct 

<210> 41 

<211> 25 

<212> DNA 

<213> Zea mays 

<400> 41 
atgatcacgg tggcagcacc ggagg 

<210> 42 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 42 
ggcggtcgcg gctgccgcga gcttg 

<210> 43 
<211> 25 
<212> DNA 
<213> Zea mays 
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<400> 43 
atggccaccg ccatagttcc cacgg 

<210> 44 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 44 
tgggtagacc tcgatgacac cacga 

<210> 45 

<211> 25 

<212> DNA 

<213> Zea mays 

<400> 45 
atggccctca tgcaggagag tagta 

<210> 46 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 46 
aggatagacc tcgatgatga ccgat 

<210> 47 

<211> 25 

<212> DNA 

<213> Zea mays 

<400> 47 
atggcactca tgcaagagag cagta 

<210> 48 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 48 
aggatagacc tcgatgatga ccgac 

<210> 49 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 49 
atggcactca tgcaggagag cagcc 
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<210> 50 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 50 
tggatagacc tcgatgaccg acatg 

<210> 51 
<2il> 25 
<212> DNA 
<213> Zea mays 

<400> 51 
atggtccttc tcttcgtgga gaagc 

<210> 52 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 52 
aagcgttctt ggcttgcaca cgatg 

<210> 53 

<211> 25 

<212> DNA 

<213> Zea mays 

<400> 53 
atggacctcg ccctcctaga gaagg 

<210> 54 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 54 
ggcctcgagg ggcttgcaga cgatg 

<210> 55 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 55 
atggcaccgg tggaggcgga gcagc 

<210> 56 
<211> 25 
<212> DNA 
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<213> Zea mays 

<400> 56 
cagagctggg accggcgagc tccct 

<210> 57 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 57 
atggaagagc aaggcggcca ggcgg 

<210> 58 

<211> 25 

<212> DNA 

<213> Zea mays 

<4O0> 58 
ggcggcagag ccgaggctgc cggcg 

<210> 59 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 59 
atggcgggag gcaaggaagc gcacg 

<210> 60 

<211> 25 

<212> DNA 

<213> Zea mays 

<400> 60 
gcctccaatc ttgctgcccg cgacg 

<210> 61 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 61 
atggccacca cggcgaccga ggcgg 

<210> 62 
<211> 25 
<212> DNA 
<213> Zea mays 



<400> 62 
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65 



cttgacgcgg cggcagaggg tgacg 



25 



<210> 63 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 63 

atggcttccg ccggcgctgg agaag 25 

<210> 64 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 64 

ggcgacgcgg cggcacagcg tgagc 25 

<210> 65 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 65 

atggctgccg gcggcgacga caeca 25 

<210> 66 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 66 

gaegacgegg eggcagatgg tgacg 25 

<210> 67 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 67 

atgacegteg tcgacgccgt egtet 25 

<210> 68 

<211> 25 

<212> DNA 

<213> Zea mays 

<400> 68 

ggcacggatg gcgatccctc ectge 25 



<210> 69 
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<211> 25 
<212> DNA 
<213> Zea mays 

<400> 69 
ccacgcgtcc ggtcgtgggc ctcga 

<210> 70 
<211> 25 
<212> DNA 
<213> Zea mays 

<4O0> 70 
tcaggcatgg aagggttggg aaaat 

<210> 71 

<211> 25 

<212> DNA 

<213> Zea mays 

<400> 71 
ggcacgagat ttcccggacg gtccg 

<210> 72 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 72 
ctaggccttg tgttgacgaa gaaga 

<210> 73 

<211> 526 

<212> PRT 

<213> Zea mays 



<400> 73 



Met 


Val 


Thr 


Val Ala 
5 


Lys 


He 


Ala 


Met 


1 
Ser 


Trp 


val 


Phe Leu 


Gly 


Thr 


Leu 


Ala 






20 








25 


Arg 


Arg 


Gly 


Lys Ala 


Pro 


Leu 


Pro 


Pro 






35 








40 




Val 


Gly 


Asn 


Met Ala 


Met 


Met 


Asp 


Gin 




50 








55 






Ala 


Leu 


Ala 


Glu Arg Tyr 


Gly 


Gly 


Leu 


65 








70 








Leu 


His 


Ala 


Phe Ala 


Val 


Ser 


Thr 


Pro 








85 










Gin 


Ala 


Gin Asp Gly 


Ala 


Phe 


Ser 


Asn 








100 








105 
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25 



25 



25 



25 



Glu Trp Leu Gin Asp Pro Leu 
10 15 
Leu Val Val Leu Gin Leu Arg 
30 

Gly Pro Lys Pro Leu Pro lie 
45 

Leu Thr His Arg Gly Leu Ala 
60 

Leu His Leu Arg Leu Gly Arg 

75 80 
Glu Tyr Ala Arg Glu Val Leu 
90 95 
Arg Pro Ala Thr He Ala He 
110 
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Ala 


Tyr 


Leu 


Thr 






115 




Pro 


Phe 


Trp 


Arg 




130 






Arg 


Arg 


Arg 


Ala 


145 








Leu 


Val 


Arg 


Ala 


Asn 


Leu 


Gly 


Glu 








180 


Ala 


Ala 


Pne 


Gly 






195 




lie 


Leu 


Gin 


Glu 




210 






Phe 


Leu 


Pro 


Trp 


225 








Leu Arg 


Ala 


Ala 


Asp Glu 


His 


Val 








260 


Met 


Val 


Asp 


Asp 






275 




Lys 


Gly 


Pro 


Ala 




290 






Arg 


Leu 


Thr 


Arg 


305 








Gly 


Gly 


Thr 


Glu 


Met 


Met 


His 


Ser 








340 


Asp 


Val 


Val 


Gly 






355 




Leu 


Pro 


Phe 


Leu 




370 






Pro 


He 


Pro 


Leu 


385 








Gly Tyr 


Ser 


Val 


lie 


Gly 


Arg 


His 








420 


Ser 


Arg 


rile 


inr 






435 




Gly 


Cys 


Phe 


Glu 




450 






Gly Thr 


Ala 


Leu 


465 








Ala 


His 


Gly 


Phe 


Leu 


Asp 


Met 


Gly 








500 


Leu Tyr 


Ala 


Val 



Tyr 


Asp 


Arg 


Ala 








120 


Gin 


Met 


Arg 


Lys 






135 




Glu 


Thr 


Trp 


Val 




150 






Val 


Ala 


Ser 


Gly 


165 








Leu 


lie 


Phe 


Asn 


Thr 


Arg 


Asp 


Gly 








200 


Phe 


Ser 


Lys 


Leu 






215 




Leu 


Ser 


Trp 


Met 




230 






Arg 


Ser 


Ala 


Leu 


245 








Arg 


Arg 


Gly 


Lys 


Met 


Leu 


Ala 


Phe 








280 


Ala 


Ala 


Ala 


Asp 






295 




Asp 


Asn 


He 


Lys 




310 






Thr 


Val 


Ala 


Ser 


325 








Pro 


Asp 


Asp 


Leu 


Leu 


Asp 


Arg 


Asn 








360 


Lys 


Cys 


Val 


lie 






375 




Leu 


Leu 


His 


Glu 




390 






Pro 


Arg 


Gly 


Ser 


405 








Arg 


Ala 


Ser 


Trp 


Pro 


bill 


<jr±y 


GlU 








440 


Phe 


Leu 


Pro 


Phe 






455 




Gly 


Leu 


Tyr 


Ala 




470 






Asn 


Trp 


Ser 


Leu 


485 








Asp 


Val 


Phe 


Gly 


Pro 


Thr 


Pro 


Arg 



67 



Asp 


Met 


Ala 


Phe 


Leu 


Cys 


Val 


Met 








140 


Ala 


Val 


Arg 


Asp 






155 




Gly 


Gly 


Gly 


Gly 




170 






Leu 


Thr 


Lys 


Asn 


185 








GlU 


Asp 


Gin 


Glu 


Phe 


Gly 


Ala 


Phe 








220 


Asp 


Leu 


Gin 


Gly 






235 




Asp 


Arg 


Phe 


He 




250 






Asn 


Pro 


Asp 


Asp 


265 








Phe 


Ala 


Glu 


Ala 


Gly 


Asp 


Asp 


Leu 








300 


Ala 


He 


He 


Met 






315 




Ala 


He 


Glu 


Trp 




330 






Arg 


Arg 


Leu 


Gin 


345 








Val 


Asn 


Glu 


Ser 


Lys 


Glu 


Thr 


Leu 








380 


Thr 


Ala 


Gly 


Asp 






395 




Arg 


Val 


Met 


Val 




410 






Lys 


Asp 


Ala 


Asp 


425 








ax a 


Ala 


Gly 


Leu 


Gly 


Ser 


Gly 


Arg 








460 


Leu 


Glu 


Leu 


Ala 






475 




Pro 


Asp 


Gly 


Met 




490 






Leu 


Thr 


Ala 


Pro 


505 








Leu 


Asn 


Cys 


Pro 



Ala 


His 


Tyr 


Gly 


125 








Lys 


Leu 


Phe 


Ser 


Glu 


Cys 


Ala 


Ala 








160 


Gly 


Glu 


Ala 


Val 






175 




Val 


Thr 


Phe 


Arg 




190 






Glu 


Phe 


lie 


Ala 


205 








Asn 


Val 


Val 


Asp 


He 


Asn 


Arg 


Arg 








240 


Asp 


Lys 


He 


He 






255 




Ala 


Asp 


Ala 


Asp 




270 






Lys 


Pro 


Pro 


Lys 


285 








His 


Asn 


Thr 


Leu 


Asp 


Val 


Met 


Phe 








320 


Ala 


Met 


Ala 


Glu 






335 




Gin 


Glu 


Leu 


Ala 




350 






Asp 


Leu 


Asp 


Lys 


365 








Arg 


Leu 


His 


Pro 


Cys 


Val 


Val 


Gly 








400 


Asn 


Val 


Trp 


Ala 






415 




Ala 


Phe 


Arg 


Pro 




430 






Asp 


Phe 


Lys 


Gly 


445 








Arg 


Ser 


Cys 


Pro 


Val 


Ala 


Gin 


Leu 








480 


Lys 


Pro 


Ser 


Glu 






495 




Arg 


Ala 


Thr 


Arg 




510 






Leu 


Tyr 
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515 520 





<210> 


74 












<211> 


263 












<212> 


PRT 












<213> 


Zea 


mays 








<400> 


74 










Met 


Ala 


Thr 


Thr 


Ala 


Thr 


Glu 


Ala 


1 








5 








Gin 


*j±n 


Ala 


Asn 


Gly 


Asn 


Gly 


Asn 








20 








Glu 


vai 


Qjj.y 


His 


Lys 


Ser 


Leu 


Leu 






35 










40 


lie 


T a« « 

Leu 


Asp 


Thr 


Ser 


v a j. 


Tyr 


Pro 




50 










55 




Leu 


Arg 


Glu 


He 


Thr 


Ala 


Lys 


His 


65 










70 




Ala 


Asp 


Glu 


Gly Gin 


Phe 


Leu 


Asn 










85 








Lys 


Lys 


Thr 


Met 


Glu 


He 


Gly 


Val 








100 








Thr 


Ala 


Leu 


Ala 


Leu 


Pro 


Glu 


Asp 






115 










120 


Asn 


Arg 


Glu 


Asn 


Tyr 


Glu 


Leu 


Gly 




130 










135 


val 


Gly 


His 


Lys 


He 




Phe 


Arg 


145 










-L. J \J 




Asp 


Asp 


Leu 


Val 


Ala 


Acn 
nop 


Lys 


Glu 










165 








Phe 


Val 


Asp 


Ala 


Asp 


Lys 


Asp 


Asn 








180 










Leu 


Lys 


Leu 


Val 


Arg 


Pro 


Gly 


Gly 






195 










200 


Trp 


Asn 


Gly 


Ser 


Val 


Val 


Leu 


Pro 




210 










215 




He 


Arg 


Phe 


Tyr Arg 


Asp 


Phe 


Val 


o o c 










230 






Ala 


Asp 


Asp 


Arg 


Val 


Glu 


He 


Cys 










245 






Thr 


Leu 


Cys 


Arg Arg 


Val 


Lys 










260 










<210> 


75 












<211> 


585 












<212> 


PRT 












<213> 


Zea 


mays 








<400> 


75 










Met 


Ala 


Met 


Ala 


He 


Ser 


Ser 


Ala 



525 



Ala 


Lys 


Ala 


Ala 


Pro 


Ala 


Gin 


Glu 


Gly 


10 










15 




Glu 


Gin 


Lys 


Thr 


Arg 


His 


Ser 


25 










30 






Lys 


Ser 


Asp 


Asp 


Leu 


Tyr 


Gin 


Tyr 










45 






Arg 


Glu 


Pro 


Glu 


Ser 


Met 


Lys 


Glu 








60 








Pro 


Trp 


Asn 


Leu 


Met 


Thr 


Thr 


Ser 






75 










80 


Met 


Leu 


He 


Lys 


Leu 


He 


Gly 


Ala 




90 










95 




Tyr 


Thr 


Gly 


Tyr 


Ser 


Leu 


Leu 


Ala 


105 










110 






Gly 


Thr 


He 


Leu 


Ala 


Met 


Asp 


He 










125 






Leu 


Pro 


Cys 


He 
140 


Asn 


Lys 


Ala 


Gly 


Glu 


Gly 


Pro 


Ala 


Leu 


Pro 


Val 


Leu 




His 


155 










160 


Gin 


Gly 


Ser 


Phe 


Asp 


Phe 


Ala 




170 








175 




Tyr 


Leu 


Ser 


Tyr 


His 


Glu 


Arg 


Leu 


185 










190 




Leu 


He 


Gly 


Tyr 


Asp 


Asn 


Thr 


Leu 










205 








Asp Asp 


Ala 


Pro 


Met 


Arg 


Lys 


Tyr 








220 








Leu 


Ala 


Leu 


Asn 


Ser 


Ala 


Leu 


Ala 


Gin 




235 










240 


Leu 


Pro 


Val 


Gly 


Asp 


Gly 


val 




250 










255 




Leu 


Pro 


Cys 


Ser 


Leu 


Leu 


Val 


Ala 




10 










15 
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69 



Ala Leu Met Leu Leu Ala Ser Val Val Gin Val Gin Gly lie Thr Arg 

20 25 30 

His Tyr Asp Phe Asn Val Thr Met Ala Asn Val Thr Arg Leu Cys Ala 

35 40 45 

Ser Lys Ser lie lie Thr Val Asn Gly Gin Phe Pro Gly Pro Lys lie 

50 55 60 

Val Ala Arg Glu Gly Asp Arg Leu Val lie Arg Val Thr Asn His Ala 
65 70 75 80 

Gin His Asn lie Ser Xaa His Trp His Gly lie Arg Gin Leu Arg Thr 

85 90 95 

Gly Trp Ala Asp Gly Pro Ala Tyr lie Thr Gin Cys Pro lie Gin Thr 

100 105 110 

Gly Gin Ser Tyr Val Tyr Asn Tyr Thr Val Val Gly Gin Arg Gly Thr 

115 120 125 

Leu Trp Trp His Ala His lie Ser Trp Leu Arg Ala Thr Val Tyr Gly 

130 135 140 

Pro Leu Val lie Leu Pro Lys Leu Gly Val Pro Tyr Pro Phe Pro Ala 
145 150 155 160 

Pro Tyr Lys Glu Val Pro Val lie Phe Gly Glu Trp Trp Leu Ala Asp 

165 170 175 

Thr Glu Val Val lie Lys Gin Ala Leu Gin Leu Gly Ala Gly Pro Asn 

180 185 190 

Val Ser Asp Ala His Thr lie Asn Gly Leu Pro Trp Pro Leu Tyr Asn 

195 200 205 

Cys Ser Ala Lys Asp Thr Tyr Lys Leu Lys Val Lys Pro Gly Lys Thr 

210 215 220 

Tyr Met Leu Arg Leu lie Asn Ala Ala Leu Asn Asp Glu Leu Phe Phe 
225 230 235 240 

Ser Val Ala Asn His Ser Leu Thr Val Val Glu Val Asp Ala Val Tyr 

245 250 255 

Val Lys Pro Phe Thr Val Asp Thr Leu Leu lie Ala Pro Gly Gin Thr 

260 265 270 

Thr Asn Val Leu Leu Ala Ala Lys Pro Ser Tyr Pro Gly Ala Asn Tyr 

275 280 285 

Tyr Met Ser Ala Ala Pro Tyr Ser Thr Ala Arg Pro Ala Thr Phe Asp 

290 295 300 

Asn Thr Thr Val Ala Gly lie Leu Glu Tyr Glu Leu Tyr Pro Asp Ala 
305 310 315 320 

Pro Arg Pro Ser Ala Ser Ala Gly Ser Phe Asn Glu Ala Leu Pro Leu 

325 330 335 

Tyr Arg Pro Thr Leu Pro Gin Leu Asn Asp Thr Asn Phe Val Gly Asn 

340 345 350 

Phe Thr Ala Lys Leu Arg Ser Leu Ala Thr Pro Arg Tyr Pro Ala Ala 

355 360 365 

Val Pro Arg Thr Val Asp Arg Arg Phe Phe Phe Ala Val Gly Leu Gly 

370 375 380 

Thr His Pro Cys Pro Ala Asn Ala Thr Cys Gin Gly Pro Thr Asn Thr 
385 390 395 400 

Thr Gin Phe Ala Ala Ser Val Asn Asn Val Ser Phe Val Leu Pro Thr 

405 410 415 

Lys Ala Leu Leu His Ser His Phe Thr Gly Leu Ser Ser Gly Val Tyr 
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420 425 430 

Ser Pro Asp Phe Pro Val Ala Pro Leu Ala Pro Phe Asn Tyr Thr Gly 

435 440 445 

Thr Pro Pro Asn Asn Thr Asn Val Ala Ser Gly Thr Lys Leu Met Val 

450 455 460 

Val Pro Tyr Gly Ala Asn Val Glu Leu Val Met Gin Gly Thr Ser lie 
465 470 475 480 

Leu Gly Val Glu Ser His Pro Leu His Leu His Gly Phe Asn Phe Phe 

485 490 495 

Val Val Gly Gin Gly Tyr Gly Asn Tyr Asp Pro Val Asn Asp Pro Ser 

500 505 510 

Lys Phe Asn Leu Val Asp Pro Val Glu Arg Asn Thr Val Gly Val Pro 

515 520 525 

Ala Gly Gly Trp Val Ala lie Arg Phe Leu Ala Asp Asn Pro Gly Val 

530 535 540 

Trp Phe Met His Cys His Leu Glu Ala His Thr Thr Trp Gly Leu Arg 
545 550 555 ' 560 

Met Ala Trp Leu Val Leu Asp Gly Ser Leu Pro His Gin Lys Leu Leu 

565 570 575 

Pro Pro Pro Ser Asp Leu Pro Lys Cys 
580 585 

<210> 76 

<211> 1859 

<212> DNA 

<213> Zea mays 

<220> 
<221> CDS 

<222> (79) . . . (1656) 
<400> 76 

gtcgacccac gcgtccgcta aaacccaaac gaaataaaag gaagaggtca aaaaataaaa 60 
ggtgttgtgc aatcgatc atg gtg acc gtg gcc aag ate gec atg gag tgg ill 

Met Val Thr Val Ala Lys lie Ala Met Glu Trp 
1 5 10 

etc caa gac cct ctg age tgg gtg ttc ctg ggc acg ctg gcc ttg gtg 159 
Leu Gin Asp Pro Leu Ser Trp Val Phe Leu Gly Thr Leu Ala Leu Val 
15 20 25 

gtc ctg cag ctg cga cga egg ggc aaa gcg ccg ctg ccg ccc ggg ccg 207 
Val Leu Gin Leu Arg Arg Arg Gly Lys Ala Pro Leu Pro Pro Gly Pro 
30 35 40 

aag ccg ctg ccg ate gtg ggc aac atg gcg atg atg gac cag ctg acc 255 
Lys Pro Leu Pro lie Val Gly Asn Met Ala Met Met Asp Gin Leu Thr 
45 50 55 

cac cgc ggg ctg gcg gcg ctg gcc gag agg tac ggc ggg ctg ctg cac 303 
His Arg Gly Leu Ala Ala Leu Ala Glu Arg Tyr Gly Gly Leu Leu His 
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60 65 70 75 

etc cgc ctg ggc egg ctg cac gcg ttc gcg gtg teg acg ccc gag tac 351 
Leu Arg Leu Gly Arg Leu His Ala Phe Ala Val Ser Thr Pro Glu Tvr 
80 85 90 

gcg cgc gag gtg ctg cag gcg cag gac ggc gcg ttc teg aac egg ccg 399 
Ala Arg Glu Val Leu Gin Ala Gin Asp Gly Ala Phe Ser Asn Arq Pro 
95 ioo 105 

gee act ate gee ate gcg tac ctg acg tac gac cgc gec aac atcr aca aa7 
Ala Thr lie Ala He Ala Tyr Leu Thr Tyr Asp Arg Ala Asp Met £2 
110 115 12 o 

ttc gcg cac tac ggg ccc ttc tgg cgc cag atg cgc aag ctg tgc gtq 495 
Phe Ala His Tyr Gly Pro Phe Trp Arg Gin Met Arg Lyi Leu Cys Val 
125 130 135 

atg aag ctg ttc age egg cgc cgc gee gag acg tgg gtg gee gtg cgc 54 3 

Met Lys Leu Phe Ser Arg Arg Arg Ala Glu Thr Trp Val Ala Val Ara 
140 145 150 15 5 

gac gag tgc gcg gcg ctg gtc cgc gee gtg gcg tec ggc ggc ggc ggc 591 
Asp Glu Cys Ala Ala Leu Val Arg Ala Val Ala Ser Gly Gly Gly Gly 
160 165 170 

ggc ggc gag gee gtg aac ctg ggc gag etc ate ttc aac ctg acc aag 639 
Gly Gly Glu Ala Val Asn Leu Gly Glu Leu He Phe Asn Leu Thr Lys 
175 180 i 8 5 

? a ° S*? ^ g t w C CgC gcc gcc ttc gac acc cac aac TOG gag gac cag 687 
Asn Val Thr Phe Arg Ala Ala Phe Gly Thr Arg Asp Gly Glu Asp Gin 
190 195 200 

gag gag ttc ate gcc ate ctg cag gag ttc teg aag ctg ttc ggc gcc 73 5 

Glu Glu Phe He Ala He Leu Gin Glu Phe Ser Lys Leu Phe Gly Ala 
205 210 215 

ttc aac gtc gtc gac ttc ctg ccg tgg ctg age tgg atg gac ctg cag 783 
Phe Asn Val Val Asp Phe Leu Pro Trp Leu Ser Trp Met Asp Leu Gin 
220 225 230 235 

ggc ate aac cgc cgc etc cgc gcc gca cga tec gcg ctg gac egg ttc 831 
Gly lie Asn Arg Arg Leu Arg Ala Ala Arg Ser Ala Leu Asp Arg Phe 
240 245 250 

ate gac aag ate ate gac gag cac gtg agg egg ggg aag aac ccc gac 879 
He Asp Lys He He Asp Glu His Val Arg Arg Gly Lys Asn Pro Asp 
25 5 260 265 

gac gcc gac gcc gac atg gtc gac gac atg etc gcc ttc ttc gcc gag 927 
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Asp Ala 


Asp 
270 


Ala 


Asp 


Met 


Val 


Asp 
275 


Asp 


Met 


Leu 


Ala 


Phe 
280 


Phe 


Ala 


Glu 




gcc aag 
Ala Lys 
285 


ccg 
Pro 


ccc 
Pro 


aag 
Lys 


aag 
Lys 


ggg 

Gly 
290 


ccc 
Pro 


gcc 
Ala 


gcc 
Ala 


gcc 
Ala 


gcg 
Ala 
295 


gac 
Asp 


ggt 
Gly 


gac 
Asp 


gac 
Asp 


975 


ctg cac 
Leu His 
300 


aac 
Asn 


ace 
Thr 


etc 
Leu 


egg 
Arg 
305 


etc 
Leu 


acg 
Thr 


cgc 
Arg 


gac 
Asp 


aat 
Asn 
310 


ate 
He 


aag 
Lys 


get 
Ala 


ate 
He 


ate 
He 
315 


1023 


atg gac gtg 
Met Asp Val 


atg 
Met 


ttt 
Phe 
320 


ggc 
Gly 


ggg 

Gly 


acg 
Thr 


gag 
Glu 


acg 
Thr 
325 


gtg 
Val 


gcg 
Ala 


teg 
Ser 


gcg 
Ala 


ate 
He 
330 


gag 
Glu 


1071 


tgg gcg 
Trp Ala 


atg 
Met 


gcg 
Ala 
335 


gag 
Glu 


atg 
Met 


atg 
Met 


cac 
His 


age 
Ser 
340 


ccc 
Pro 


gac 
Asp 


gac 
Asp 


ctg 
Leu 


cgc 
Arg 
345 


egg 
Arg 


ctg 
Leu 


1119 


cag cag 
Gin Gin 


gag 
Glu 
350 


etc 
Leu 


gcc 
Ala 


gac 
Asp 


gtc 
Val 


gtg 
Val 
355 


ggc 
Gly 


ctg 
Leu 


gac egg 
Asp Arg 


aac 
Asn 
360 


gtg 
Val 


aac 
Asn 


gag 
Glu 


1167 


teg gac 
Ser Asp 
365 


ctg 
Leu 


gac 
Asp 


aag 
Lys 


etc 
Leu 


ccc 
Pro 
370 


ttc 
Phe 


etc 
Leu 


aag 
Lys 


tgc 
Cys 


gtc 
Val 
375 


ate 
He 


aag 
Lys 


gag 
Glu 


acg 
Thr 


1215 


etc egg ctg 
Leu Arg Leu 
380 


cac 
His 


ccg 
Pro 


ccg 
Pro 
385 


ate 
He 


ccg 
Pro 


ctg 
Leu 


etc 
Leu 


ctg 
Leu 
390 


cac 
His 


gag 
Glu 


ace 
Thr 


gcc 
Ala 


ggc 

Gly 
395 


1263 


gac tgc 
Asp Cys 


gtc 
Val 


gtg 
Val 


ggc 
Gly 
400 


ggc 
Gly 


tac 
Tyr 


tec 
Ser 


gtg 
Val 


ccc 
Pro 
405 


agg 
Arg 


ggc 
Gly 


tec 
Ser 


cgc 
Arg 


gtc 
Val 
410 


atg 
Met 


1311 


gtc aac 
Val Asn 


gtg 
Val 


tgg 
Trp 
415 


gcc 
Ala 


ate 
lie 


ggc 
Gly 


cgc 
Arg 


cac 
His 
420 


cgc 
Arg 


gcc 
Ala 


teg 
Ser 


tgg 
Trp 


aag 
Lys 
425 


gac 
Asp 


gcc 
Ala 


1359 


gac gcg 
Asp Ala 


ttc 
Phe 
430 


egg 
Arg 


ccg 
Pro 


teg 
Ser 


cgc 
Arg 


ttc 
Phe 
435 


acg 
Thr 


ccc 
Pro 


gag 
Glu 


ggc 
Gly 


gag 
Glu 
440 


gcc 
Ala 


gcg 
Ala 


ggg 

Gly 


1407 


etc gac 
Leu Asp 
445 


ttc 
Phe 


aag 
Lys 


ggc 
Gly 


ggc 
Gly 


tgc 
Cys 
450 


ttc 
Phe 


gag 
Glu 


ttc 
Phe 


ctg 
Leu 


ccc 
Pro 
455 


ttc 
Phe 


ggc 
Gly 


tec 
Ser 


ggc 
Gly 


1455 


cgc cgc 
Arg Arg 
460 


teg 
Ser 


tgc 
Cys 


ccc 
Pro 


ggc 
Gly 
465 


acg 
Thr 


gcg 
Ala 


ctg 
Leu 


ggc 
Gly 


ctg 
Leu 
470 


tac 
Tyr 


gcg 
Ala 


ctg 
Leu 


gag 
Glu 


etc 
Leu 
475 


1503 
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gcc gtc gcc cag etc gcg cac ggc ttc aac tgg teg ctg ccc gac ggc 1551 

Ala Val Ala Gin Leu Ala His Gly Phe Asn Trp Ser Leu Pro Asp Gly 
480 485 490 

atg aag ccc teg gag ctg gac atg ggc gac gtc ttc ggc etc ace acq 1599 
Met Lys Pro Ser Glu Leu Asp Met Gly Asp Val Phe Gly Leu Thr Ala 
495 500 505 

ccg cgc gcc acg agg etc tac gcc gtg cct acg ccc egg etc aac tgc 1647 
Pro Arg Ala Thr Arg Leu Tyr Ala Val Pro Thr Pro Arg Leu Asn Cvs 
510 515 520 

ccc ttg tac tgacgccatg cgcgggcgac tgccattacc atcgtcccct 1696 
Pro Leu Tyr 
525 

cgggtgggtg tggggtaegg gggtaggagt ttggtgcctt tctctgtcgt cttttttccc 1756 

tttaaaaaac atgcctggtc gatgttgtag ggtgtgttgt agacagecat tatcaatttt 1816 

ttttattctc aaaaaaaaaa aaaaaaaaaa aaagggegge cgc 1859 

<210> 77 
<211> 1218 
<212> DNA 
<213> Zea mays 

<220> 
<221> CDS 

<222> (112) . . . (900) 
<400> 77 

gtcgacccac gegtccgata cccgacgcgc aaccagtgcc gcacccagac cagatctccg 60 
cgacatatca gtcgttcgtc cagctaactg cactgcactg cactgcacgc a atg gcc 117 

Met Ala 
1 

ace acg gcg ace gag gcg gcc aag get gca ccg gcg cag gag cag cag 165 
Thr Thr Ala Thr Glu Ala Ala Lys Ala Ala Pro Ala Gin Glu Gin Gin 
5 10 15 

gcc aac ggc aac ggc aac ggc gag cag aag acg cgc cac tec gag gtc 213 
Ala Asn Gly Asn Gly Asn Gly Glu Gin Lys Thr Arg His Ser Glu Val 
20 25 30 



ggc cac aag age ctg etc aag age gac gac ctg tac cag tac ate ctg 
Gly Hxs Lys Ser Leu Leu Lys Ser Asp Asp Leu Tyr Gin Tyr lie Leu 
35 40 45 ' 50 

gac acg age gtg tac ccg egg gag ccg gag age atg aag gag ctg cgc 
Asp Thr Ser Val Tyr Pro Arg Glu Pro Glu Ser Met Lys Glu Leu Arg 
55 60 65 



261 



309 



357 



405 
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gag ate acc gec aag cac cca tgg aac ctg atg acc acc tec gec gac 
Glu lie Thr Ala Lys His Pro Trp Asn Leu Met Thr Thr Ser Ala Asp 
70 75 80 

gag ggc cag ttc etc aac atg etc ate aag etc ate ggc gee aag aaq 
Glu Gly Gin Phe Leu Asn Met Leu He Lys Leu He Gly Ala Lys Lys 
85 90 95 

acc atg gag ate ggc gtc tac acc ggc tac teg etc etc gee acc gcg 453 
Thr Met Glu He Gly Val Tyr Thr Gly Tyr Ser Leu Leu Ala Thr Ala 
100 105 no 

etc gca etc ccg gag gac ggc acg ate ttg gee atg gac ate aac cgc 501 
Leu Ala Leu Pro Glu Asp Gly Thr He Leu Ala Met Asp He Asn Ara 
115 120 125 130 

gag aac tac gag eta ggc ctt ccc tgc ate aac aag gee ggc gtg gac 549 
Glu Asn Tyr Glu Leu Gly Leu Pro Cys He Asn Lys Ala Gly Val Gly 
135 140 145 

cac aag ate gac ttc cgc gag ggc ccc gcg etc ccc gtc ctg gac gac 597 
His Lys He Asp Phe Arg Glu Gly Pro Ala Leu Pro Val Leu Asp Asp 
150 155 160 

etc gtg gcg gac aag gag cag cac ggg teg ttc gac ttc gee ttc gtg 645 
Leu Val Ala Asp Lys Glu Gin His Gly Ser Phe Asp Phe Ala Phe Val 
165 170 175 

gac gee gac aag gac aac tac etc age tac cac gag egg etc ctg aag 693 
Asp Ala Asp Lys Asp Asn Tyr Leu Ser Tyr His Glu Arg Leu Leu Lys 
180 185 190 

ctg gtg agg ccc ggc ggc etc ate ggc tac gac aac acg ctg tgg aac 741 
Leu Val Arg Pro Gly Gly Leu He Gly Tyr Asp Asn Thr Leu Trp Asn 
195 200 205 210 

ggc tec gtc gtg etc ccc gac gac gcg ccc atg cgc aag tac ate cgc 789 
Gly Ser Val Val Leu Pro Asp Asp Ala Pro Met Arg Lys Tyr He Arg 
215 220 225 

ttc tac cgc gac ttc gtc etc gee etc aac age gcg etc gee gec gac 
Phe Tyr Arg Asp Phe Val Leu Ala Leu Asn Ser Ala Leu Ala Ala Asp 
230 235 240 



837 



gac cgc gtc gag ate tgc cag etc ccc gtc ggc gac ggc gtc acg etc 885 
Asp Arg Val Glu He Cys Gin Leu Pro Val Gly Asp Gly Val Thr Leu 
245 250 255 

tgc cgc cgc gtc aag tgaaaaaaag aagaagaaga aaaaaaacat aataccctgc 940 
Cys Arg Arg Val Lys 
260 
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gttcctgctg ccccggctgt ctggccccca ctactgccac cgacggcggc gccgaacccc 1000 

cgttccaatc atcatatcgt agacgacgcg cagcattaaa ctatcaatca ccggatctgg 1060 

ctctttcttg gccctgtact gtactattaa tgttccgttc ttgttttttt attcggaatt 1120 

gtcgccgttt cagtatacgt aaatctcgag gtcgataata cagtaatact accaatttaa 1180 

ctgtataaaa aaaaaaaaaa aaaaaaaagg gcggccgc 1218 

<210> 78 
<211> 2230 
<212> DNA 
<213> Zea mays 

<220> 
<221> CDS 

<222> (170) ... (1924) 
<400> 78 

tggtacgcnt gcaggtaccg gtccggantt cccgggtcga cccacgcgtc cgcattattc 60 
ttcctccacc catctcttga gtcgcctggc cggccgccgt cttggttccc ctctagagct 120 
caacagcaga gcagctgtgt agcatagagc gaggtttaac catcagcgc atg gcc atg 178 

Met Ala Met 
1 

gcg ate tec tct get ctt ccg tgc tec etc etc gtg gcg gcc ctg atg 2 26 

Ala lie Ser Ser Ala Leu Pro Cys Ser Leu Leu Val Ala Ala Leu Met 
5 10 15 

etc etc gcc tec gtc gtc caa gtg caa ggc ate acg agg cac tac gac 274 
Leu Leu Ala Ser Val Val Gin Val Gin Gly lie Thr Arg His Tyr Asp 
20 25 30 35 

ttc aat gtg ace atg gcg aac gtg aca egg ctg tgc gcc age aag age 3 22 

Phe Asn Val Thr Met Ala Asn Val Thr Arg Leu Cys Ala Ser Lys Ser 
40 45 50 

ate ate acg gtg aac ggg cag ttc ccc ggg ccc aag ate gtg gcg agg 370 
lie He Thr Val Asn Gly Gin Phe Pro Gly Pro Lys He Val Ala Arg 
55 60 65 

gaa ggc gac egg etc gtc ate cgc gtc ace aac cac gcc cag cac aac 418 
Glu Gly Asp Arg Leu Val He Arg Val Thr Asn His Ala Gin His Asn 
70 75 80 

ate teg ntg cac tgg cac ggc ate egg cag ctg cgc acg ggg tgg gcg 466 
He Ser Xaa His Trp His Gly He Arg Gin Leu Arg Thr Gly Trp Ala 
85 90 95 

gac ggg ccg gcg tac ate acg cag tgc ccg ate cag acg ggg cag agt 514 
Asp Gly Pro Ala Tyr He Thr Gin Cys Pro He Gin Thr Gly Gin Ser 
100 105 no H5 



WO 99/10498 PCT/US98/17519 

76 

tac gtg tac aac tac acc gtc gtg ggg cag cgc ggc acg ctg tgg tgg 56 2 

Tyr Val Tyr Asn Tyr Thr Val Val Gly Gin Arg Gly Thr Leu Trp Trp 
120 125 130 

cac gcg cac ate tec tgg ctg cgc gec acc gtc tac ggg ccc etc gtc 610 

His Ala His lie Ser Trp Leu Arg Ala Thr Val Tyr Gly Pro Leu Val 

135 140 " 145 

ate ctg ccc aag etc ggc gtc ccc tac ccg ttc ccg gcg ccc tac aag 658 

lie Leu. Pro Lys Leu Gly Val Pro Tyr Pro Phe Pro Ala Pro Tyr Lys 

150 155 160 

gag gtc ccc gtc ate ttc ggt gag tgg tgg ctg gcg gac acg gag gtg 706 

Glu Val Pro Val lie Phe Gly Glu Trp Trp Leu Ala Asp Thr Glu Val 

165 170 175 

gtg ate aag cag gcg ctt cag etc ggc get ggc ccc aat gtc tct gac 754 

Val lie Lys Gin Ala Leu Gin Leu Gly Ala Gly Pro Asn Val Ser Asp 
180 185 190 195 

gee cac acc ate aac ggc ctg cca tgg ccg etc tac aac tgc tct gec 802 

Ala His Thr lie Asn Gly Leu Pro Trp Pro Leu Tyr Asn Cys Ser Ala 
200 205 " 210 

aaa gac acg tac aag ctg aag gtg aag ccc ggg aag acg tac atg ctg 8 50 

Lys Asp Thr Tyr Lys Leu Lys Val Lys Pro Gly Lys Thr Tyr Met Leu 

215 220 225 

cgc etc ate aac gcg gcg etc aac gac gag etc ttc ttc tec gtc gec 898 

Arg Leu lie Asn Ala Ala Leu Asn Asp Glu Leu Phe Phe Ser Val Ala 

230 235 240 

aac cac teg etc acg gtc gtc gag gtc gac gee gtc tac gtc aag ccc 946 

Asn His Ser Leu Thr Val Val Glu Val Asp Ala Val Tyr Val Lys Pro 

245 250 255 

ttc acc gtc gac acg ctg etc ate gcg ccg ggc cag acc acc aac gtg 994 

Phe Thr Val Asp Thr Leu Leu lie Ala Pro Gly Gin Thr Thr Asn Val 
260 265 270 275 



ctg etc gee gec aag ccg tec tac ccg ggc gee aac tac tac atg tec 
Leu Leu Ala Ala Lys Pro Ser Tyr Pro Gly Ala Asn Tyr Tyr Met Ser 
280 285 ^ 290 



1042 



gec gcg ccc tac tec acc gee agg ccg gec acc ttc gac aac acc acc 1090 
Ala Ala Pro Tyr Ser Thr Ala Arg Pro Ala Thr Phe Asp Asn Thr Thr 
295 300 305 

gtc gee ggc ate etc gag tac gag ctg tac ccc gac gcg ccc egg ccc 113 8 

Val Ala Gly lie Leu Glu Tyr Glu Leu Tyr Pro Asp Ala Pro Arg Pro 
310 315 320 
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tec gec tec gcg ggg age ttc aac gag gee ctg ccg etc tac aga ccg 1186 
Ser Ala Ser Ala Gly Ser Phe Asn Glu Ala Leu Pro Leu Tyr Arg Pro 
325 330 335 

acc ctg ccg cag etc aac gac acc aac ttc gtc ggc aac ttc acg gee 1234 
Thr Leu Pro Gin Leu Asn Asp Thr Asn Phe Val Gly Asn Phe Thr Ala 
340 345 350 355 

aag etc cgc age etc gcg acg ccg egg tac ccg gcg gee gtg ccg egg 1282 
Lys Leu Arg Ser Leu Ala Thr Pro Arg Tyr Pro Ala Ala Val Pro Arg 
360 365 370 

acg gtg gac agg egg ttc ttc ttc gcg gtc ggg etc ggc acg cac ccg 1330 
Thr Val Asp Arg Arg Phe Phe Phe Ala Val Gly Leu Gly Thr His Pro 
375 380 385 

tgc ccc gee aac gee acg tgc cag ggc ccc acc aac acc acg cag ttc 1378 
Cys Pro Ala Asn Ala Thr Cys Gin Gly Pro Thr Asn Thr Thr Gin Phe 
390 395 400 

gcg gcg tec gtc aac aac gtc tec ttc gtg etc ccc acc aag gcg ctg 1426 
Ala Ala Ser Val Asn Asn Val Ser Phe Val Leu Pro Thr Lys Ala Leu 
405 410 415 

ctg cac tec cac ttc acc ggc ctg tec age ggc gtc tac teg ccg gac 1474 
Leu His Ser His Phe Thr Gly Leu Ser Ser Gly Val Tyr Ser Pro Asp 
420 425 430 435 

ttc ccc gtc gcg ccc ctg gcg ccg ttc aac tac acg ggg acg ccg ccc 1522 
Phe Pro Val Ala Pro Leu Ala Pro Phe Asn Tyr Thr Gly Thr Pro Pro 
440 445 " 450 

aac aac acc aac gtg gee age ggg acc aag etc atg gtc gtc ccg tac 1570 
Asn Asn Thr Asn Val Ala Ser Gly Thr Lys Leu Met Val Val Pro Tyr 
455 460 * 465 

ggc gee aac gtg gag etc gtc atg cag ggc acc age ate etc ggc gtc 1618 
Gly Ala Asn Val Glu Leu Val Met Gin Gly Thr Ser He Leu Gly Val 
470 475 480 

gag age cac ccg ctg cac ctg cac ggc ttc aac ttc ttc gtg gtc ggc 1666 
Glu Ser His Pro Leu His Leu His Gly Phe Asn Phe Phe Val Val Gly 
485 490 495 

caa ggg tac ggc aac tac gac ccc gtc aac gac ccg tec aag ttc aac 1714 
Gin Gly Tyr Gly Asn Tyr Asp Pro Val Asn Asp Pro Ser Lys Phe Asn 
500 505 510 515 

etc gtc gac ccc gtc gag cgc aac acc gtc ggc gtg ccg gee ggc gga 1762 
Leu Val Asp Pro Val Glu Arg Asn Thr Val Gly Val Pro Ala Gly Gly 
520 525 530 

tgg gtg gee ate cgc ttc etc gec gac aac ccc ggg gtc tgg ttc atg 1810 
Trp Val Ala He Arg Phe Leu Ala Asp Asn Pro Gly Val Trp Phe Met 
535 540 545 
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cat tgc cat ttg gag gcg cac aca aca tgg ggc etc agg atg gca tgg 18 58 

His Cys His Leu Glu Ala His Thr Thr Trp Gly Leu Arg Met Ala Trp 
550 555 560 

ttg gtg etc gac ggc age etc ccg cac cag aag ctg etc ccg ccg ccg 1906 
Leu Val Leu Asp Gly Ser Leu Pro His Gin Lys Leu Leu Pro Pro Pro 
565 570 575 

tea gac tta ccc aaa tgt tgattagact cttcctctat ctctatcctg 1954 
Ser Asp Leu Pro Lys Cys 
580 585 

ccggtcgctt caaattaaag ggaatgtgaa ttagacaaat gtttgtttgt ttttttgttt 2014 

actttcttca ttgccaattg caattttttc aacttgeatt ttaactagtc cgttccgttc 2074 

ctagctgacc tggacttttt tgtaattttt ttcttccatt tgtttgccac cacaaatgtt 2134 

tttgtacact cctctgaaaa taaagaatgg cgtgacttgc accagataaa aaaaaaaaaa 2194 

aaaaaaaaaa aaaaaaaaaa aaaaaagggc ggcege 223 0 

<210> 79 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 79 

atggtgaccg tggecaagat cgeca 2 5 

<210> 80 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 80 

tcagtacaag gggcagttga geegg 2 5 

<210> 81 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 81 

atggccacca cggcgaccga ggegg 2 5 

<210> 82 
<211> 25 
<212> DNA 
<213> Zea mays 

<400> 82 

tcacttgacg eggeggcaga gcgtg 2 5 

<210> 83 
<211> 25 
<212>. DNA 
<213> Zea mays 
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<400> 83 
atggccatgg cgatctcctc tgctc 

<210> 84 
<211> 25 
<212> DNA 
<213> Zea mays 



<400> 84 
tcaacatttg ggtaagtctg acggc 
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